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Introduction 


Many of the disciplines associated with theoretical physics and the allied 
engineering subjects rely on variational statements which naturally arise from 
physical considerations or which are judiciously adjusted so as to do so. For 
those disciplines in which the variational statements lead to ordinary differential 
equations, the Hamiltonian formalism and the theory of canonical maps provide 
basic investigatory tools of extreme generality and breadth of application. There 
are, however, no such basic investigatory tools available when the variational 
statements lead to partial differential equations. Thus it would be advantageous 
to investigate the formal structure of partial differential equations which arise 
from variational statements from the standpoint of extending the Hamiltonian 
formalism and the theory of canonical maps where existent. 

There is, at present, little or no distinction made between the assumptions 


which delineate the intrinsic geometry of the space of independent variables — 


and the assumptions employed in constructing variational statements concerning 
collections of functions defined relative to the space of independent variables. 
This situation is particularly surprising in that one need only assume the space 
of independent variables is a Hausdorff space in order to obtain a well defined 
calculus of variations, and hence to construct variational statements. It would 
thus also be advantageous to investigate the formal structure of partial differential 
equations which arise from variational statements, where the space of independent 
variables is assumed to be a Hausdorff space for which there is no intrinsic 
geometry assumed. 

The orientation of this paper is toward an answer to the following question: 
What are the extensions of the Hamiltonian formalism and the theory of canonical 
maps to systems of partial differential equations which arise from variational 
statements, where no assumptions are made concerning the space of independent 
variables other than that it shall form a Hausdorff space? It is realized that a 
complete answer to this question is a task of dismayingly large scope. However, 
it is felt that the principal results are presented from which a complete answer 
may be developed. 

The reader is assumed to be familiar with the general results of Hamiltonian 
formalism and the theory of canonical maps for systems of ordinary differential 
equations (henceforth referred to as the classic theory)!. Such knowledge is not 

' For a concise treatment of the results of the classic theory see DE Wirt, B. S.: 
Rey, Mod Phys. 29,"No, 39377-3384 (1957). For a more exhaustive treatment see 
WuitTaker, E. T.: A Treatise on the Analytical Dynamics of Particles and Rigid 


Bodies, 4th ed. Cambridge, 1937. Wintner, A.: The Analytical Foundations of 
Celestial Mechanics, Chante iearnca ie Brnéeton University Press, 1947. 
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mandatory in that the results and developments given, when specialized to the 
case of one independent variable, constitute a direct development of the classic 
theory insofar as it will be needed for comparison and understanding. In those 
cases in which the results differ markedly from the results of the classic theory, 
the differences will be explicitly demonstrated. When it is felt to be illustrative, 
reference to the corresponding results of the classic theory will be made from 
the original literature. This organization appears preferable to an introductory 
exposition of the pertinent literature in that, first, it enables the reader to draw 
immediate comparisons, and second, there is almost no literature for the case 
of more than one independent variable in which one of the independent variables 
is not given preference over the others through functional representations. 


Section I. Preliminary Considerations 
Notation 


An ordered collection of a finite number of scalars 0' will be referred to as 
the components of a vector b. By an n-vector will be meant a vector with n 
components. 

Let &, be an n-dimensional vector space of variable 2-vectors #. This space 
will be referred to as the space of independent variables. 

A point set J, contained in &, will be referred to as a domain if it is an open, 
connected, nonvacuous point set. We denote by G* the closure of ,. By the 
boundary of Y* shall be meant the set Z*OQ,, where © stands for the set- 
theoretic difference. 

A scalar or vector function of # is said to be of class C” on @,, where y is 
a fixed positive integer, if the function on the domain Q&, under consideration 
is such that all partial derivatives of order y or greater exist and are continuous 
on Y,. 

We shall be concerned with collections of functions of both vector and scalar 
nature defined over Y*. Greek indices shall denote elements of the collections 
of functions considered, and Latin indices shall denote components of vectors 
defined over Y*. Let q,(a) be a typical collection of scalar functions defined 
for all x in Y* and for « an element of .e, where . is a finite set of N elements. 
The set ./ is regarded as the index set of such a collection of functions of x 
and N will be termed the numerosity of the index set 7. Such a collection 
qqa(#) will be denoted by q when there is no possibility of ambiguity. Let q 
and ‘q be two collections of functions defined over Y* with the same index set. 
A map ‘q=‘q(q, £) will be said to be of class C’!if both q and ‘q are of class C” 
and the map has a local inverse everywhere in the Y, under consideration. If 
q and ‘q are considered as imbedded in spaces of functions of class C”, then 
the map ‘\q=‘q(q, @) is said to be of class C"! if it is locally topological. 

Partial differentiation will be denoted by a comma. When the partial deri- 
vative is to be taken with respect to an element of a collection of functions 
defined over Z, or G*, it will be denoted by a comma followed by the symbol 
for the particular element as a subscript (that is, H,.=0H/0q). When diffe- 
rentiation is with respect to a component of a, say x’, it will be denoted by 
(),;- Differentiation of a functions of q and # with respect to x where the q 


Wie 
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are held constant will be denoted by 4; (that is, 0,H = 0H|/éx'|,). Hence, 
H (q(x), x) ; re asi a, i Se 0;H. 
ae A 


Whenever a Latin or Greek index appears twice in a term or product, sum- 
mation is implied. Whenever summation is implied on a Greek index, the sum- 
mation is to be extended throughout the index set 7. The situation in which 
an index appears twice but is not to be summed will be denoted by x with 


ce 


the index over which summation is not to be extended beneath it (that is, “no 
summation on «” is written >). 


Column matrices will be “denoted by {}, row matrices by [], and square 
matrices by A, B,.... Denote by E the N xN element identity matrix and by 
E* the N(n+1) xN(n-+1) element identity matrix. All matrices considered will 
be either defined over or relative to the collections of functions defined over 
G, or BF. 

Let {u} be an S-element column matrix and {v(w)} an S-element column 
matrix each of whose elements is a function of the elements of {w}. By {v} 2 
we shall mean the square matrix defined by 


ef a 4 
£0}. guy = W = (Weg) = (800/00). 
If f(u) is a scalar function whose arguments are the elements of {uw}, then by 
f guy we shall mean the row matrix defined by 


def 
oe = [4,] — [07/6 ,] S 
Similarly, {f ,} is a column matrix defined by 
def , 
Wh sf — (f {u}) 


where “‘prime’”’ denotes the transpose. 


Vectrices and Vectrix Algebra 


The analyses and proofs involved in the study of the invariance group asso- 
ciated with Hamiltonian systems of partial differential equations becomes ex- 
tremely cumbersome without the introduction of what we refer to as vectrices 
and their associated algebra. Loosely speaking, vectrices are ordered collections 
of matrices, where the ordering is similar to that used in the definition of vectors 
as ordered collections of scalars. As might be expected from the similarity of 
ordering between vectrices and vectors, the algebra of vectrices will have many 
properties in common with the algebra of vectors. 

Let X,, be a Euclidean space of 1 dimensions which is independent of the 
space &, of variable m-vectors a. By an ordering basis will be meant a fixed 
collection of constant orthonormal bases vectors é,, é9,..., é, Which span X,,. 
The notation ¢; rather than e, is used to note specifically the fact that the 2; 
are subsidiary quantities which are independent of any algebraic processes of 
either vector or scalar nature in &,. 

Definition. Let A; (t(=1,...,”) be n arbitrary matrices each of which has 
the same number of rows and columns and whose elements are defined over 


or relative to &,. By a vectrix A will be meant the collection of matrices A; 
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ordered by an ordering basis @1, é2,..., @,, aS defined by the equation 
A = 2; A;. 


The matrices A, will be referred to as the components of the vectrix A. It 
must be clearly borne in mind that the @; are an ordering basis constructed from 
the subsidiary space X,, and do not operate algebraically with the component 
matrices A, (7.e., ¢; A; is not to be interpreted in the sense of a product between 
a vector and a matrix). This condition is easily remembered since the @; are 
defined from the subsidiary space X,, while all the elements of the component 
matrices A; are functions defined over or relative to the space &, of variable 
n-vectors. For instance, @;E is a vectrix whose first component is the identity 
matrix, and whose remaining components are NXxN null matrices, ¢) E is a 
vectrix whose second component is the identity matrix and whose remaining 
components are N x N null matrices, etc. 

Definition. Let A= e, A; and Be, B, be two arbitrary vectrices whose com- 
ponents have the same number of rows and columns. The operations of addition, 
multiplication by a matrix M, transposition, and inner multiplication are 


defined by a LB i det > Sel eeny 
MA“¢MA,, AM“2,4,M 
A A 
AoB< A,B, 


respectively, where MA, and A; B; are multiplied by the laws of matrix multi- 
plication. 

Definition. Two vectrices A=e; A; and B=¢; B; are said to be equal if and 
only if the component matrix A; is equal to the component matrix B; for all 7 


(7.e., equal component by component). 
From the above definitions, it is evident that vectrices admit the following 


algebraic properties : Fea Fe A 
A+(B+C)=(A+B)+C0 
A+B=A+C=>B=C 


M(A+B)=MA+MB 
(M+N)A=MA+NA 
(MN) A= M(NA) 


MA=AM 
if and only if M commutes with A, for all 7 
(M.A)! = A’ M’ 


AeMB=AMeB 
(A+ B)eC=AeC+ Be 


> 


(Ae B)'=B'eA’. 
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To complete the above results, we define the null vectrix, O, as a vectrix all 
of whose components are null matrices. We shall refer to vectrices as column, 
row or square vectrices if their component matrices are column, row or square 
matrices respectively. 

A specific class of vectrices which have central importance in the analyses 
to follow are those vectrices referred to as structure vectrices. 


Definition. A vectrix A is said to be a structure vectrix if and only if 


(1) Ais a constant square vectrix (1.1) 
cies eer (1.2) 
(iii) det |4e A] £0. (1.3) 


Definition. The right and left-hand inverse structure vectrices corre- 


sponding to a structure vectrix A are defined by 


(i (477) =— Ay (1.4) 
(ii) Ave A=E*=AcA (1.5) 
(iii) A= CA. (1.6) 


Theorem 1.1. The right and left-hand inverse structure vectrices corresponding 


is 
to a structure vectrix A are unique and are given by 


Ay} = (A ® ) 14 
; ‘ (1.7) 
A;'= A(Ae A)1, 


Proof. Substituting equation (1.6) into (1.5) gives 


E* = A e¢A=CAeA, 
from which we obtain 
C=(Ae A 
whose existence is assured by equation (1.3) since A is a structure vectrix by 
hypothesis. Thus, by equation (1.6) Apt= (Ae A)11A, from which the result 
A=A(Ae A)2 follows by equation (4.2) and (1.4). Q.E.D 
Given the right and left-hand inverse structure vectrices corresponding to 


a structure vectrix A we may construct collections of vectrices with the properties 
(i) BeA=E* 
(ii) AeF=E*. 


Specifically, let C, and C, be any two vectrices with the properties C,eA = 0 
and Ae C, = O, respectively; then upon setting 
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we have BeA=(A46)eA=ApeA+ CoA 
=A oA OX Re. 


The following theorem establishes the basis for many of the results in the 
sections to follow. 


Theorem 1.2. Let A and B be a given structure vectrix and a matrix respectively, 


and let F be a vectrix which ts to satisfy 
AeF=B. (1.8) 
Then the most general form which F can assume is 
FAIALBH Gi Ge =O (1.9) 
(1-2. F is determined by equation (1.8) only to within an arbitrary vectrix whose 


inner product with A, from the right, gives the null matrix). 


Proof. Multiplying equation (1.8) from the left by A} gives 


A,}\(4eF) =A) BEG 
Hence 
AeG=(AeA-)) (AeF)=AeF, 
so that 


and hence F—G=C where AeC—O. Thus, since G=AB we obtain = 
A;'B-+C where C is an arbitrary vectrix to within the condition deC=O. 
OrEsB): 

We shall often find it convenient to represent a vectrix as a matrix whose 
elements are linear functions of the ordering base. For example, 


=> —_> —& 
Ci Ciao © 2e3 
> > — a 
es €3 —eé,1/=B 
— => =e 
3 ey 4éo 


represents the vectrix whose component matrices are given by 


Need 0) O19 80 OO 
Cialic ui bal I rere, PON Wand "101° 6 
Oar leak Chenille! Ae erO 


respectively; that is 


ss jae: erat) Or 4-1K0 0. One? 
B=2410 0—-1}l+el1i 0 O}]+46{0 1 0 
kale Lian Gregg 1400 


Situations will arise in Sections IV and VII in which summations of the 
form A,x' will appear, where A; are the components of a vectrix and x’ are the 
components of an m-vector. Since both vectrices and m-vectors are ordered in 
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a similar manner, we may introduce the following notation: 

AQa “A, x. (1.10) 
Since vectrices are ordered relative to an ordering base obtained from the sub- 
sidiary space X,,, it is evident that the order of terms in the ®) product, as defined 
by equation (1.10), is immaterial; that is 


oa 


AQu=2QA. 


Local and Non-Local Questions 


To this point, we have made no specific statements concerning the structure 
of &, other than it is a space of ordered collections of scalars, which in no way 
implies any sort of structure for &, other than that for which the algebra of 
its elements is a linear algebra. We now proceed to delineate the postulates of 
structure for &, which shall be implied in the remainder of this work. 

By a space &, we shall mean a set of ordered collections of scalars, called 
points, for which one can define a system of sub-sets called neighborhoods, 
satisfying the following conditions: 

(i) The points of each neighborhood can be put into a one to one reciprocal 
correspondence with the interior points of a hypersphere of the Euclidean space 
of m dimensions. 


(ii) Each point of &, belongs to at least one neighborhood. 


(iii) For any two neighborhoods of any given point of 6, their exists a neigh- 
borhood which is contained in the intersection of the two given neighborhoods. 


(iv) If a point b is contained in a neighborhood U(a), of a point a, then there 
exists a neighborhood V(b) which is contained in U(a). 


(v) If a and b are any two points of &,, then there exist neighborhoods U(a) 


and V(b), of a and b respectively, such that U(a) and V(b) have no points in 
common. 


Simply stated, we require &, to be a Hausdorff? space. 


In addition to the above structure, the &, considered may have a well defined 
differential geometry for a particular problem under consideration. Whether 
this is the case or not is a moot question relative to the considerations of this 
study. We shall require no further knowledge of the &, other than its satis- 
faction of postulates (i) through (v) above. 

Let f;(a#) (t=1, ..., m) be an ordered collection of functions of class C! defined 
over a given YF of &,, and consider the quantity .% defined by 


I= f,,44,, (1.11) 
Di 
where dv, stands for [] dx’. Using the ordering basis ¢;,...,2,, we may represent 
i=1 
/; as the components of a vectrix whose component matrices are the 1 <1 matrices 
ibe that is aad as 
» WILDER, L. R.: Topology of Manifolds. Amer. Math. Soc. Colloquium Publ. 
No. 32 (1949). 
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Introducing the vectrix operator V, as defined by 


equation (1.11) takes the form 


I =f Vefdr,. (1.13) 


We may, with no loss of generality, view equation (1.13) as resulting from im- 
bedding &, in X,, by identifying every point r=(x1,..., x”) of Z, with a point 
in X, with the coordinates (x1,..., x”), since &, and X,, satisfy postulates (i) 
through (v) and hence are topologically equivalent. 

We assume at this point and henceforth that the boundary of G* is rectifiable. 
Under this assumption, and noting that both &, and X, are assumed to be 
Hausdorff spaces, the divergence theorem is applicable. Thus 


F=f[Vefdr,,= [ Yfde...deé de ...dx". (1.14) 


Gn GnOGn * 
Introducing the symbolic vectrix NdSs by the equation 
NdS=2,NdS=2,dx1...dx' 1 dx'*)_.. dx", (1.15) 
equation (1.14) takes the symbolic form 


I =fVefdv,— f feNds. (1.16) 


Using only the topological structure of &, and X,, and the ordering base, we have 
obtained the global result stated in equation (1.16). Since equation (1.16) is 
the basis for obtaining conservation laws, we shall be able to obtain conservation 
laws for the systems of equations studied without requiring any knowledge of 
a differential geometry for the space &,. Although this result is an obvious 
one in that the divergence theorem is a topological theorem, it will be of con- 
siderable moment in the second of this series of papers. For this reason the 
above detailed discussion is included. 


Section II. Canonical Maps and their Fundamental Characterization 


Hamiltonian Systems and Canonical Maps 


Let q,(a) be a collection of scalar functions of class C? and $,;(x) be a col- 
lection of m-vector functions of class C! for all a in some given G*, where « ranges 
over a finite index set -M of numerosity N. Denote such a collection of functions 
by (p,q). Let be the space of all functions of class C? in their N+-nN+n 
arguments q,, ~,;, £ The space § will be referred to as the space of admissible 
bases functions. A collection of functions (f, q) is said to form a Hamiltonian 
system base H in &;** if (pf, q) satisfy the system of partial differential equations 


buii=—Aan\ eg (2.1) 
qxi=H ; 


Py 
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for all w in G*. The above equations are the partial differential form of the 
celebrated Hamiltonian equations which were introduced by HAMILTON? in 1834, 
as will be shown in Section III. 

Let J be the collection of all topological maps of class CH in (f, q) and of 
class C? in a for all # in 9* 


FT: (Pai= Pui (Laz, Qs,@); da= da (Pp; Qs, 2)). (2.2) 


If equations (2.1) be written in terms of the new variables (P, Q) for an arbitrary 
element of the collection 7, the form of equations (2.1) will not, in general, be 
preserved. The concern of this work will be the determination of the subcollection 
of Z for which the form of equations (2.1) is preserved and the properties of 
this subcollection. 

Definition. A map T of the collection ZF will be said to be canonical if 
and only if there exists, for every H(f,q,x)€H, a K(P, Q, #)€ such that 
if (p, q) form a Hamiltonian system base H in G*, then (P, Q) form a Hamil- 
tonian system base K in G*. 

From this definition we have the following statement: The form of equations 
(2.14) is invariant under the collection of all canonical maps. It should be noted 
that there exist maps of the collection ZY for which the form of equations (2.1) 
is invariant for a particular HE § but not for all HES. 

Examining the definition of a canonical map in detail, we see that T is a 
canonical map if and only if, for every HE, there exists a KE H with the 
property: if (f, q) form a Hamiltonian system base H, then (P,Q) form a Hamil- 
tonian system base K. Let §» be the subcollection of whose elements, Hy, 
are such that there exist no (p,q) which form Hamiltonian systems base Hy 
(that is, there exist no solutions for (p, q) to equations (2.1) for Hy and element 
of po). Denote by .7, the collection of all elements JT, of ZF which are such 
that for every element of $9, there exists a K€ with the property: 7f (A, q) 
form a Hamiltonian system base H)€ ,, then (P, Q) form a Hamiltonian system 
base K. Since there are no (f, q) which form Hamiltonian systems base Hy€ Ho 
by the definition of ,, the conditional statement in the above definition of J 
is vacuous, and hence we may pick any K€ § and the condition will be satisfied. 
Hence there are no restrictions on Z,, so that 7% is identical with 7. Now, 
let F, be the subcollection of 7 whose elements are such that for all elements 
of ) ON, there exists a KE with the property: 7f (p, q) form a Hamiltonian 
system base HE} OH», then (P, Q) form a Hamiltonian system base K, and 
let 7, be the collection of all canonical maps. From the definition of a canonical 
map, we have 7,= 7,07, since $=(9 OH») UH»). We have shown, however, 
that 7 is identical with 7 so that 7, is contained in Y, from whence 7,07, =7, 
and hence 7,=7,. It is also evident, for all elements of 7,, that if HEH She, 
then K must be an element of Oy, since the conditional statement in the 
definition of 7, requires (P,Q) to form a Hamiltonian system base K. Thus 
we have the following equivalent reduced form of the definition of a canonical map. 


Definition. A map T of the collection 7 will be said to be canonical if and 
only if there exists, for every HEH OH), a KE} OH, such that if (p,q) form 


° HamILton, W. R.: Phil. Transactions 95 (1835). 
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a Hamiltonian system base H in Y*, then (P,Q) form a Hamiltonian system 
base K in 9*. 


Before using the above definition to characterize the collection of canonical 
maps analytically, we proceed to introduce matrix notations which will be of 
great facility in the application of the above definition and in subsequent work. 


Define the vectrices E, and E, by 

E,=|%E%E...2,E];  E,=— Es, 
where E is the N XN identity matrix and the prime denotes transposition. Let 
O, and O, be NXN and nN xmnWN null vectrices respectively and define the 


constant vectrix I by 


=e Om 
=(° =) (2.3) 
E, O, 
so that 
O, |&E GE... ¢,E 
> | —ZE 

as O 
| 2 


From the definition of I it is evident that I satisfies the conditions required of 
a structure vectrix; that is, 


(1) Tis a constant square vectrix 
(i gum aces! (2.4) 


(iii) det |Le I) =(—1)%"*) nw +0. 


By Theorem 1.1 we may construct the right and left-hand inverse structure 


vectrices corresponding to I which have the property 


> > 


Tiel=E*; [el =E*. (2.5) 
By the same theorem, we have 


Ip =—E=—()T (2.6) 
where by equation (1.6) 


Expanding, we have 


eBLEO O 
nN 
EO p 
(y)= are 5; (2.7) 
x | we O 
O OE 
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so that . 
Me —@é@E ... =¢,F 


¥ —> 


eal 0, 
, E | 
zm 


Care must be exercised in the use of the right and left inverse vectrices. 


instance if A and B are vectrices which are related by the equation 
I(I\*A)=B 
for given Al then 


Tel (ie A)=I'eB 
gives 


TieA=T;'0B 


For 


which would appear to imply that A=B. This is not the case in general as 
shown in Theorem 1.2. A specific result which will be needed is as follows: 


if SKY and sr are column vectrices where £K} is given, then 1(I;1{K}) 
implies sh =tKt only if 


Let {R} be the N(m+1) element column matrix defined by 


{dat 
{Kk} = _ is tba 


ea n 
and let V denote the operator defined by 


so that 
{qa, i 
VER} =e, Wibod 
{Pun id 


is a well defined vectrix. 


={L} 


With the above matrix representations, the Hamiltonian equations (equa- 


tions (2.4)) are given by _, 
TeV {R} + {H3 — 00} 


(2.12) 
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(that is, 
Levees Fags 
{— da, i} ate {H, past — fo} ) 


te du, nf {7 we 


as seen by comparison with equations (2.1). 


It should be noted that vectrices may be equally well used to represent 
tensorial equations or non-tensorial equations. To illustrate the method we 
consider the following cases. 


Case 1. Let I’ and @ be a contravariant tensor of of rank one and a scalar 
density, respectively, which satisfy the equations 


i=0; 9,,=0 


where (;) is used to represent the covariant derivative. Expanding the indicated 
covariant derivatives, assuming that the underlying metric space is a Riemannian 
space with affine connection J;,, we have 


Pithjl=0, 9.—Tiy=0 


which can be represented as follows: 


Y SS Sk 

Bl ToV{RE+ {Ht ={0f, 
WO || ete 

: =) oe 

IEE 


Case 2. Let I’ and gm be a contravariant tensor of rank one and a scalar, 
respectively, which satisfy the system of equations 


i = be 
Li=P, Pr=uil’; 


then we may represent them by 


ns 


Pl To V*{R} + {Hg} = {0} 

yl 1 a 77 dl at) 

ie H=>¢,71 D3 v 
I" V 


Alpha Systems 
The systems of partial differential equations thus far considered (Hamil- 
tonian Systems) may be generalized to a larger class as follows: A column matrix 
{R} is said to form an alpha system with structure vectrix @ in a given J* 
if {R} satisfies, in Y*, the equation 


V {RY 0G + H gry = [0] (2.13) 
Or 


a’ V{R} + {H ,} = {0}. (2.13 b) 
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Many of the theorems to follow use only the properties of I which are the same 
as the properties of structure vectrices and do not depend on the particular 


form of I. Thus, if we replace the term “Hamiltonian system” in such theorems 


by the term “alpha system’ and replace I and its right and left-hand inverse 
vectrices by the structure vectrix & and its associated right and left-hand inverse 
structure vectrices, the theorems will carry over directly. We shall mark all 
theorems or parts of theorems which carry over directly to alpha systems by 
asterisks. 

As will be seen, the structure vectrix of an alpha system effectively charac- 
terizes a majority of the formal properties of such a system. Thus, the reduction 
of systems of partial differential equations to alpha systems and thereby defining 
the structure vectrices for such systems gives a convenient method of catagorizing 
such systems as to their formal properties. In this sense equations (2.13a) or 
(2.13b) may be thought of as defining a canonical form. 


Statement of the Problem 
Consider a map T of the collection 7 (i.e.: T:(p, q)—>(P, Q)) which may 


be represented as T {R= RY. 
By the associated Jacobian matrix of T will be meant the matrix 
M = {R} zy (2.14) 
which may be written in the alternate form 
A B 
M= 245 
i a car 
where A, defined by 
A = (4x, 05) ) (2.16) 
is an N x N matrix; B, defined by 
B= [(45, 7% 4) aos (qu, 73,)]; (2.17) 
isan NxnWN matrix; C, defined by 
Ci [but 09) wee Pin 03)'] , (2.18) 


is an mN x N matrix; and D, defined by 


(Px1, P31) basis ()x1, 7%») 
Dai) ee aa (2.19) 


, (Pan, P, 7) Oa (Px n, Py) 
is an wN x nN matrix. 

We are now in a position to develop the analytic characterization of canonical 
maps. Consider the forms which the terms in equation (2.12) take as the result 
of a map of the collection .7. Examining the second term of equation (242)5 
we have, for an arbitrary element of 7, upon taking the transpose 


AL ery = FL gy {Rh cry; 
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but ~ 
{R} gx3 = M1 


{7 rt = (A gry)’ 


{7 nf = M(H 5}. 


et V be the vectrix operator defined by 


and 


so that 


V2 2.a,(). (2.20) 
The quantity VAR will be interpreted as the N(m+1) element column vectrix 


obtained if one evaluates VER | (R, a) and then expresses R in terms of Randa 


by the transformation equations. Considering the first term of equation (2.12), 
we have under T 


VERY = {RY cy VERY + VARY 
so that 
V{R} = M VER} — M VER}. 
Hence, the left member of equation (2.12) becomes under T 
To VER} + {Hp} = MY {MIM o (VERY — VERY) + {Ha}, (2.21) 
upon noting that 


IeMV=IMerp. 


Since the left-hand side of the above equation vanishes by equation (2.12), if 
(p, q) form a Hamiltonian system and M~™’ is nonsingular for all a in >, we have 


M'IMe (VERY -- V«R}) + {Hz} = {0}. (2.22) 


By definition, 7, which results in equation (2.22), is a canonical map if and 
only if there exists, for every HE O Hy, a KE H O Hp such that (P, Q) forma 
Hamiltonian system base K in 9X. Stating that (P, Q) form a Hamiltonian 
system base K in Q* is equivalent to stating that (P, Q) satisfy 


Te VER} +{K g}= fo} (2.23) 


by equation (2.12). We are thus faced with the problem of establishing what 
conditions must be placed on a map T of the collection Y such that equation 
(2.21) or (2.22) reduces to equation (2.23) whenever (, q) form a Hamiltonian 
system base H for all HED — Hp. 


The Fundamental Theorem 


The arguments leading to the establishment of the required conditions in 
order for a map to be canonical are somewhat lengthy and involved. For this 
reason we give, at this point, a cursory outline of the procedure so that the 
forest may not be lost for the trees. 
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We first reformulate the problem in terms of operators over matrix collections 
of function spaces and interpret these operators as inducing mappings to other 
matrix collections of function spaces. We then fix the basis function and formulate 
in Lemma 2.1 the conditions that equation (2.21) reduces to equation (2.23) in 
terms of the kernels of the maps defined over the function spaces considered. 
We then obtain in Lemma 2.2 necessary and sufficient conditions that the con- 
ditions of Lemma 2.1 be satisfied in terms of the existence of subsidiary maps 
on the operators considered. These two lemmas then allow us to state the neces- 
sary and sufficient conditions that equation (2.21) is equivalent to equation 
(2.23) for fixed H, Lemma 2.3 and Theorem 2.2. The result thus obtained is 
then required to hold for all basis functions of the collection ) © from which 
we obtain the fundamental theorem (Theorem 2.2) which analytically charac- 
terizes the most general canonical map. 

Denote by t the space of all N(m+1) element column matrices {R(ax)} of 
class C1 for all 2 in some given Y* and by & the space of all N(n+1) element 
column matrices {s(a)} of class C for all # in Y*. Under the assumed continuity 
of the elements of St we may construct a continuous matrix operator {Dy} over 
jh ae 
yt, defined by Oy} (7) act Te Pike} 

fYER, HEHOL, xEeQ* 


which assigns to every element of 3 a new element which is at least of class C. 
We consider this operator as a continuous mapping of Xt into GS: 


(2.24) 


LOut: R > Sy cS. 


Denote by oy the kernel of the above map which is assumed to be nonvoid 
(t.e., LED Oy) 
{Ont t On > {Pf (2.25) 


O 
where {®} is the zero element of S defined by {o| . Let T be a map of the 
collection 7 so that 0 


T:R> MR. (2.26) 
Under such a map we have seen that the operator {O,,} is transformed by the 


equation : a 
T: {Oy} > {On} 


where 
a def ji Ea aa = a 
{Ou} (1) — M1'{M'I Me (V ft — Vi) +{H; INen (2.27) 
(ie, FOV AR} + {Hg} = M1 {M' I Me (V {R} — 7 {R}) + LH ay}). 
Since T is the clement of the collection 7, the elements of R are of class C!, 
and hence {©,,} may be interpreted as mapping NR into S: 
{Oy}: KM“ Ey cS. 
Let 04 be the image of oy, under T, so that 
{Ox} On 5 {PD}. 
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We define a new operator {07} by 


{D8} (u) S M'{On} (u) (2.28a) 
(i.e., 


{98} («) =’ I Mo (V {u} — V {u}) + {H,} (2.28b) 


as is seen from equation (2.27) upon noting that M is nonsingular by hypothesis). 
The operators {D,} and {Of} both have the same kernel, namely 6,, when 


considered as mapping i into ©, as is immediate from equations (2.27) and 
(2.28) upon noting again that M is nonsingular. 


Consider now an operator {Dx} defined over ni by 
{Dich (w) SLOP +{K 3, (ER, KEHOH. — (2.29) 
Since K is assumed to be an element of ©, and the elements of HR are of 
class C!, {Dx} may be considered as a continuous map of & into ©: 


{Ox}: KAS Gx cS. (2.30) 
Denote by gx the kernel of {Ox} so that 


From the form of equation (2.29) we see that the elements of 0, form Hamil- 
tonian system base K in R. Similarly, from the definition of 6, (that is, 6, is 
the image of the kernel of {O,}, in jt, under T) and equation (2.24) it is evident 
that the elements of 6, are the images, under 7, of those elements of St which 
form Hamiltonian systems base H. 

Let H be a fixed element of HOH). Hf, for some KE H O Ho, Oy is contained 
in 0x, then T is such that it maps every point of }t which forms a Hamiltonian 
system base H into a point in Jt which forms a Hamiltonian system base K 
(that is, 6, contained in @x implies that for every {uw} an element of 6,,, {Dx}(u) = 
{®}, and hence {uv} forms a Hamiltonian system base K). Conversely, if T 
is to be such that it maps every point of }t which forms a Hamiltonian system 
base H into a point in jt which forms a Hamiltonian system base K, then K 
must be such that {Dx} maps 6, into {®} and hence 6,, must be contained in ox. 
We have thus proved the following 

*Lemma 2.1. A necessary and sufficient condition that a map T of the col- 
lection J map all points of R which form Hamiltonian systems base H, for fixed 
HES OD, into Hamiltonian systems in Rt is that there exist at least one function 
KEH OD, defined over Rt, such that Oy ts contained in Ox. 

Our problem is thus reduced to seeking conditions such that @, is contained 
in 0x for some KE HOH). Let A(R, x) be an N(n+1) xN(n+1) element non- 
singular matrix all of whose elements are of class C1 for all x in Y* and for all 
{RY in St. For each such matrix we may define a new operator 

{Ox at = A {Ox (2.32) 
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where the indicated multiplication is matrix multiplication. Under the continuity 
assumption on A we may interpret {Ox jy as a map of # into © 
{Dx}: R——> Sai cS. (2.33) 
Combining equations (2.30) and (2.32), we have 
{Oh () ={FESKES, FER, 
{Dish (7) =A{OKP() HALE SKACS, 


and hence the matrix multiplication of {D,} by A may be considered to induce 
a transformation of Gx to Sx, as defined by 


A{s$ = {Sp {spC Sx; {s}€ Sx, 
SK — - {s,} . 
We shall refer to such a transformation induced on Sy by equations (2.32) and 


(2.33) as a lambda transformation and the matrix A as a lambda matrix. 
The collection of all lambda matrices we shall denote as Q. 


Let Mx, be the set of all elements {m} of J which satisfy 
{Duh (m) = {s} = A{Ox} (m) = {Ox jf (m) 


for A an element of 4 and K an element of ©, . For simplicity, we shall 
denote the set Wx, and its defining relation by 


{Ot = {Dx}. (2.34) 


Mk 
By equation (2.28a) we have 
M+ {03} = {Ox}, 
R 
and hence equation (2.34) is equivalent to 
j/f[nx ran 
M 1 {O15 gp MIOKE 
since Wik, is contained in NR. Noting that Mis nonsingular by definition, we have 
* 7 a a 
{Or Sa UL A{Ox}. 


Thus, we consider the set IN~, defined by 
ok 
{On iy {Ox} . (2.335) 


Let {u} be an element of 6, and in addition an element of IR*, for some 
ACA, say Ay. The assumption that {wv} is an element of 6,, implies that {OF} (uw) = 
{P}; and {w} an element of I, implies, by equation (2.35), that {D} =A LOx} (uv). 
If {s,} is the image, in S, of {uv} under {Ox} so that {Ox} (uw) = {sx}, then the 
consideration immediately preceding implies that {®}=A {s,% and hence that 
{vf is an element of gx, since Ay is nonsingular by hypothesis. Thus, a sufficient 
condition that 0, be contained in ox is that there exists at least one lambda 
matrix, say A), such that 0, is contained in N* 4, Conversely, if 6, is contained 
In ox, then for any {wj an element of 0, we have {D7} (w) = {Od and {Dx} (w) {or 
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Hence, equation (2.35) is satisfied for {w}¢@, and for any ACA so that 6, is 
contained in IN&,. We have thus proved 


*Lemma 2.2. A necessary and sufficient condition for Og to be contained in 
0x 1s that there exist at least one lambda matrix A, C A such that the set M#, , defined 
* . PS 0 
by {Ont iii, Ao {Oxp, contains 6,7. 
Combining the above two lemmas, we have 


*Lemma 2.3. A necessary and sufficient condition that a map T of the collection 

F map all points of RX which form Hamiltonian systems base H, for H a fixed 

element of SOH_, nto Hamiltonian systems in Kt is that there exist at least one 

function KE) ON , defined over K, and at least one lambda matrix Ay such that 
the set of points M*%,, defined by 

{On a An Oi ts (2.35) 


Mx A, 
shall contain @,. If these conditions are satisfied, then K is the new base function 
corresponding to H under T. 

This lemma is the basis for the investigation of the conditions which must 
be placed on a map T in order for it to be canonical. Using equations (2.28b), 
(2.29) and (2.35) and the above lemma, we see that a necessary and sufficient 
condition that JZ map Hamiltonian systems base H, for fixed H, into Hamil- 
tonian systems is that there exist at least one function K and at least one lambda 
matrix such that 


M'IM « (V {8} — VER}) +H =m (LEV ER +H{K DB) (230) 


shall hold for all points IK ,, which are elements of jt where IME, is such that 
it contains 6,. At this point we must distinguish between two cases. 


Case 1. The point set 6, forms at least a one parameter continuum. In this 
case, the requirement that equation (2.36) holds for all points of a set NK a, Rt 
and that NM, contains @,, implies that we require equation (2.36) to be satisfied 
over 4 continuum of points of ht. This can only be the case if the coefficients 
of the various derivatives of £RY in equation (2.36) vanish separately. 

Case 2. The point set 6, is composed of a finite number of elements. In 
this case we proceed as follows. Since it is required that equation (2.36) hold 
for all points of a set IN, which contains @,, we require MK,, to be at least 
a one-parameter continuum containing 6,. For this choice, equation (2.36) 
can only hold over such a continuum if the coefficients of the various derivatives 


of ER} vanish separately. It must be noted that in this case we are actually 
placing added conditions on the problem since there can exist functions K for 
which equation (2.36) is only satisfied for a set of points of which are finite 
in number and contain 6,. Thus, in this case, the following analysis will only 
result in sufficient conditions rather than both necessary and sufficient conditions. 


The requirement that the coefficients of the various derivatives of {RY vanish 
separately results in WIM=2XI (2.37) 
ALK a} = {Hg} — MIM / {RY}, (2.38) 


8* 
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since V is not a differential operator but assigns a function of (R,a) as seen 
0 


from its definition (equation (2.20)). Using equation (2.37) in equation (2.38) 
gives, upon multiplication on the left by 47, 


{K a} = 21H a} — To PER}. (2.39) 


Examining equation (2.39), we see that it is a first order differential equation 
for the determination of K. 

A necessary and sufficient condition for the existence of a general solution 
to equation (2.39) is that the matrix J={K jz} ¢z be symmetric, since all terms 
in equation (2.39) are of class Cl. Setting A += ym, we have, since I is a constant 
vectrix, 

I= (ntl eh), cay — Te (VARY) cs. 
But 
(7&R}), ea = (PM), 
and hence 
I= (n {Fl xh), ci — (Le VM) M. 


The integrability conditions of equation (2.39) thus read: 


O = (n{# x), c& — [(n{93), cay)’ — le 7M) M+ MMe 
which, since i 1 gives 
O = (nf a}), cay — (nH a) a) — (oe PM — We. 
Set 
v= — (Ie 7 M7) M—M' 7M 'eI, 
0 
then 


M2’ » M4 = — M2’ le VM2— (VM>)'eI M2, 
0 0 
=— Voi IM»), 
From equation (2.37) we have 4M’ I— IM where y=A1, and hence 


M+’ IM+ = M1'yM’T 
so that 
CaN V (M2’yM’)) oIM. 
Thus, 


O= (n{4 a), <9 — [(n {4 a3), <8)’ — MV (M2 M)] eIM (2.40) 


are the integrability conditions for equation (2.39). Since H and M are given 
(that is, M satisfies equation (2.37)), equation (2.40) is a partial differential 


equation for the determination of y(R, a). We have thus proved 
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*Theorem 2.1. A necessary and sufficient condition in the case in which Oy 
forms a continuum, and a sufficient condition in the case in which 64 is composed 
of only a finite number of elements, for a map T of the collection J to map a Hamiil- 
tonian systems base H, for fixed HES OD,y, into a Hamiltonian system, is that 
(1) the Jacobian matrix of the transformation satisfies equation (2.37) and (2) 
that there exists a solution 4 (R, ax) to equation (2.40). In addition, if the above 
conditions are satisfied, then equation (2.39) is completely integrable and determines 
the new basis function K. 


By definition, a map of the collection 7 is canonical if and only if it maps 
all Hamiltonian systems formed from all bases functions into Hamiltonian 
systems; or equivalently, it maps all Hamiltonian systems formed from all 
bases functions of the collection ©, into Hamiltonian systems. In order 
to apply the results of Theorem 2.1 to characterize canonical maps, we must 
distinguish between those Hamiltonian systems whose solutions form at least 
a one-parameter continuum and those Hamiltonian systems which admit only 
a finite number of solutions. 


Denote by , the sub-collection of § ©,» which are such that the Hamil- 
tonian systems formed with basis functions which are members of this sub- 
collection admit at least a one parameter continuum of solutions, and by , 
the sub-collection of O, for which the Hamiltonian systems formed from 
this sub-collection of basis functions admit only a finite number of solutions. 
Since all Hamiltonian systems formed from basis functions of the collection 
§ © Hp are such that they admit at least one solution and the conditions defining 
§, and §, are mutually exclusive possibilities, we see that the set-theoretic 
union of §, and $, contains and is contained in Op, and that the inter- 
section of $, and , is empty. Let 7, be the collection of all maps T of the 
collection Y which map Hamiltonian systems formed from basis functions of 
the sub-collection , into Hamiltonian systems, 7, the collection of all maps T 
of the collection 7 which map Hamiltonian systems formed from basis functions 
of the sub-collection $, into Hamiltonian systems, and 7, the collection of all 
canonical maps. From the definition of a canonical map, we see that 7, is 
equal to the intersection of 7, and 7,. Since Theorem 2.1 establishes necessary 
and sufficient conditions fora map to be an element of the collection 7,, anda 
sufficient condition for a map to be of the collection -7,, for given H; and since 
7, is the intersection of 7, and 7,; we see that a map T is canonical if and 
only if conditions (1) and (2) of Theorem 2.1 hold for all H contained in Op. 

Examining condition (1) of Theorem 2.1, we see that M'IM=XI must be 
satisfied for all H which are elements of ©, simultaneously. If T is to map 
all Hamiltonian systems simultaneously into Hamiltonian systems, then it must 
be independent of the base function for any particular system, and hence the 
Jacobian matrix of the transformation must be independent of H. This can 
only be the case if A is independent of H, since we require the associated Jacobian 
matrix of a canonical map to satisfy equation (2.48). 

Condition (2) of Theorem 2.1 requires that there exist at least one lambda 
matrix A=y + which satisfies equation (2.40). Since equation (2.40) must be 
satisfied simultaneously for all H which are elements of © , and in addition 
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it is required that A must be independent of H, equation (2.40) must be an 
identity in H. This can be true only if the coefficients of H and its derivatives 


vanish separately in equation (2.40). Thus 


AA Ry cy — (AeA = 9, (2.41) 
Aga AH x} — {Hay (AGH) =O, (2.42) 
M’(V (M’2M’)]eIM=O (2.43) 

0 


for all H contained in §OQ,. Since equation (2.41) must hold for all HE HOD , 
then it must be satisfied by an H such that {H x} ck} — E*. For this case we have 
A+1—A1'=O so that A+ must be symmetric. Rewriting equation (2.41) to 
reflect this result, we obtain 


ALA Ry, c@ — {RE AT =O. 
Since the only matrix which commutes with all matrices is a multiple of the 
identity matrix, we may conclude that A? must have the general form A t= 


ll (R, a) E*, where is a scalar function. Substituting the form of A thus far 
obtained into equation (2.42) gives 


{1,3 LE ag’ — Lt ues = 0, 
which can only be satisfied for all HE OS, and for A independent of H by 
{ux} = {O}. Thus A+ must have the form A+t=w(ax)E*. Substituting into 
equation (2.43) gives > —> 
: (2-43) 8 M' (Vu(e)] 1M = 0. 
Since M is nonsingular by hypothesis, we have 
Vu (x) e1= 0. 
Thus A? is a constant multiple of the identity matrix. Since lambda matrices 
are nonsingular by definition, the constant multiplying the identity matrix must 
be non-zero. Thus At=wE*, and hence A= | E* where juis a non-zero constant. 
4 
From the above considerations we see that if there exists a non-zero constant 
f such that equation (2.37) is satisfied where A= aay Uf then equation (2.38) is 
s i 
integrable and these conditions hold for all HEH Op. 
Summarizing, we have 
MIM=17 (2.44) 


u 
and 


{K ph=u {H x} — Ie VER}. (2.45) 


1 : é oe 
Fi KE, there exists a function U(R, a) 


of class C? in its arguments (that is, .Z7 was assumed to be of class C2 in a) such 


that au eae 
LU, at = To VER}. (2.46) 


Since equation (2.40) is satisfied by A= 
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Substituting equation (2.46) into equation (2.45) and integrating gives 
K=ywH—U (2.47) 
to within an arbitrary additive function of # alone. It is evident that 
{Kab = {(K + v(x) ah 
for all {R} since {v(a) gs} =O. We may thus consider two base functions as 
equivalent if they differ by a function of # alone. Hence, we may take equation 
(2.47) as the general solution of equation (2.45). The last condition, namely 
that K be an element of $ Oo, is obviously fulfilled by inspection of equation 
(2.47), since both H and U are elements of Op. 
We have thus proved the following fundamental 
*Theorem 2.2. A map T of the collection Z is a canonical map if and only 


af there exists a nonzero constant yu such that M, the associated Jacobian matrix 
of T, satisfies the vectrix equation 


M'IM=“T, (2.48) 
im which case the new basis function, K, ts given by 
K=yH —U (2.49) 
where Le Gio 
{Ub = Te VER}. (2.50) 


It is of interest to note that this theorem for the case m—1 is identical with 
the results given by SIEGEL* and is equivalent to the results of WINTNER®, as 
will be shown in Section IV. 


Generalized Symplectic Matrices 


A matrix M which satisfies equation (2.48) in the case m=1 is termed a 
symplectic® matrix. A matrix M which satisfies equation (2.48) for ~>1 will 
be referred to as a generalized symplectic matrix of multiplier , the constant wu 
being referred to as the multiplier of the canonical map. The function U(P, Q, x), 
defined by equation (2.50), will be referred to as the remainder function. From 
the form of equations (2.50) and (2.49) it is evident that if the transformation 
equations do not involve the independent variables explicitly, the remainder 
function is, at most, a function of # alone and may thus be deleted with no 
loss of generality. Thus, in this case equation (2.49) states that the new basis 
function differs from the old basis function under the transformation T by a 
nonzero, constant multiple of 

H(p(P, Q, x), q(P, Q, a), x). 
The collection of all canonical maps may be partially partitioned into several 


natural subcollections. A canonical map is said to be uniform if the corresponding 
transformation equations do not involve the independent variables explicitly, or 


4 SIEGEL, C. L.: Vorlesungen tiber Himmelsmechanik. Berlin-G6ttingen-Heidelberg : 
Springer 1956. 

5 WINTNER, A.: The Analytical Foundations of Celestical Mechanics, equation 23, 
p- 11. Princeton University Press 1956. 

6 SIEGEL, C. L.: Vorlesungen iiber Himmelsmechanik, p. 8. 
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equivalently, the remainder function may be set equal to zero with no loss of 
generality. A canonical map is said to be basic if its multiplier is unity and 
is said to be absolutely uniform if it is both basic and uniform. 

To conclude the analysis of this section, we state the following corollary to 
Theorem 2.1. This corollary is immediate from the definition of a generalized 
symplectic matrix of multiplier yw. 

«Corollary. A map TJ of the collection 7 is a canonical map if and only if 
there exists a scalar yw such that the Jacobian matrix is a generalized symplectic 
matrix of multiplier ju. 

The definition of a canonical map given here differs from those presented 
in a majority of the literature. WHITTAKER? considers only canonical maps in 
which =1 and defines this subcollection by requiring that the differential 
form P,,dQ,—,14q, form a total differential for the case of one independent 
variable, thus defining canonical maps in terms of a first Prarr’s® expression. 
It was JacoBI® who first proved that canonical maps, as defined in the manner 
of WHITTAKER, are such as to leave the form of the Hamiltonian equations 
invariant. 

Section III. An Alternate Development 

In this section, we take up an alternate development of the conditions which 
must be placed on a map T of the collection 7 in order that it shall be canonical. 
This alternate development is included for two reasons. First, it is based on 
the Lagrangean formulation and thus will provide more direct application of 
the results to physical problems; and second, it considers the problem from the 
variational viewpoint. 

Variational Considerations 

Let « range over the finite index set W of numerosity N and let /(q,; qq, ;3 2) 
be a real function of class C? in its N(m-+1)-+-" arguments q,; q, ;, #. In addition, 
let q,(#) be of class C? in some given Y*, which assume given values on the 
boundary of Y*. We wish to establish conditions on the q,(a#) such that the 


functional 
I = J f(da(®); de, ; (#); #) dv, 
Dn 


shall have an extremal value relative to the choice of the q, (a), for the functional 
form of / fixed. 
In order to obtain the required conditions, we consider a parametric imbedding 
of the q, as follows: 
qu(1, &) , Sa ee A, 
Jal, ®),,EC%, Gu (O, H) = q, (ax) 


where the last equation is interpreted as stating that the choice of J, Which 


extremized ¥ is given by q,(a). Since q, assume given values on the boundary, 
we require in addition that 


; 9a(, ®)|9% 6 an = Ia (®)|o* o9,: 
t * WHITTAKER, E.T.: A Treatise on the Analytical Dynamics of Particles and 
Rigid Bodies, Section 126, 


® PrafF, J. F.: Abh. d. Berl. Acad. 76 (1814—1815). 
* Jacozr, K. G. J.: Compt. Rendus 61 (1837) 
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Under the above parametric imbedding, the functional .~ becomes a function 
of the parameter 77 (that is, 


1) = [1 (Galt ); da, oC, ®) ; a) dry) 


so that 4 (7) has an extremum for the value 7=0 under the condition d.4(7)/d7=0. 
Let dh(n, x) denote @/6n (h(n, a)), so that 


04 (1 YI tg 2 Ga F Ti ay, 5 8 Ia, i} 29 n- 
Dr 
Consider the vectrix Va defined by 


ef 
fae te leo Ge: (3.1) 
Now 


[Ves du, = =f Jen ds = S tea i6Ia NaS 


Dn 2ODn Dn O Dn 


by equation (1.16). However, oe vanishes on 9* © G, since q, is independent 
of 7 on 9* © G, by hypothesis. Hence 


=f Vefdv, =U es it fF ay,¢(9%.), i} dv,. 

Dn 
From the definition of the pace a 6, it is evident that 6[() ;J=[6()] ; under 
the assumed continuity conditions, and so 


jb 0 Ga, jv, Sg, Ode ety: 
Dn } 
Substituting this result into the expression for 6.4(7), we obtain 
= J {hag — Poagi),it 640 E%n 
Dn 


or equivalently PwC) vil {E| Pay © Va Fn 


where det 


ee eae Fa, i) 6 (3.2) 
is the Euler-Lagrange operator. Now, 4() will be stationary if 6.4(7)|,-9=0. 
Since 6q,|,—o is arbitrary, we obtain as a necessary condition for % to be sta- 
tionary 


(E| fo, = 9- (3.3) 
That this condition is sufficient is immediate. Note, from equations (3.1) and 
3.2), that 2 
bs Hie Ree an 


Invariance of the Euler-Lagrange Operator 
Consider now an arbitrary transformation T on q, of class Cl), 


da = da(Qs, #), 
and imbedded Q, in a parametrization by the equations 


Qe (7), x) = Qe (qx (, x), a) . 
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Now 
beef CH i Ix, 0, 06,1 
so that 
ENC ae O, Vx, v Ce ae 0;; Ta. Oe (3.4) 
a f) (3.5) 
8 Ga = Ia, 9,0 Up- 3. 
Under T we have 
ip 0B,j es Tor, One aie Lay, i Te,, i Cay 
which by equations (3.4) gives 
t 08, a Pay, j I, O° 
Hence 
t 06,j 005 ae I aas,j 0,0, 6Qz, 
which by equations (3.5) gives 
f 28,7 a) Qe = F a,j 94a = J; 
so that J is invariant under 7. Now from equation (3.2a) we have 
df —VeT ={E|f}a,94> 
which, since both 6f and i are invariant under 7, gives 
6f Ve ={E| flo, 8 Qp, 
and hence, by equation (3.5), 
{E| Phos 0 Qe car {E| Pha 9% = 1E| Phan 12, 0, 6 Qs 
so that 
{E | Pog a {E | Pyay qa, O, (3.6) 


under any T of class C?!. Thus we have the well known 


Theorem 3.1. The vanishing of the Euler-Lagrange operator 1s a necessary 
and sufficient condition for 4(n) to be extremal under all transformations T in 
its functional arguments of class Cll, 

It is evident from the definition of the Euler-Lagrange operator that it is 
a linear operator on a collection of functions of the same nature as the f/ above, 
that is, 


{E| DA fybag = 2 AE haba: (3.7) 


The Null Class of the Euler-Lagrange Operator 


We now consider to what extent the function f(q, q,;,@) is determined by 
the condition that the Euler-Lagrange Operator {E|f}, vanish. Expanding 
equation (3.6) in explicit form, we have 


El ay = lay “tidy ap 9 eae = ted gap ep, i 5 O:f ay, i (3.8) 


which may be considered as a function of the N-+-»N+4inN(n+1)-+-n variables 


da>o,i> I2,i7, *. Suppose we have two functions g(q, q,;, ©) and A(q, q ;, a) such 
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that the 2N Euler-Lagrange operators {£| g},, and {E| h},,, considered as functions 
of the N+nN=3nN(n+1)-+m variables, as above, are equal pair by pair 
(thatis, {E|g},,={E| h},,), 80 that {E| g—h},, =0. Thus, we consider the situation 


{EIR (d.3 4 


qa, i» = 


in N+nN+3nN(n+1)+ variables. Such a function k will be said to be an 
element of the null class (E) of the Euler-Lagrange operator {E |}. 

For purposes which will be evident below, we shall need to characterize 
precisely the null-class of the Euler-Lagrange operator {E| },,. 


Theorem 3.2. Let a,(x) be a collection of functions of class C? for all x in 
a given D* where the numerosity of the « index set is N. The most general element 
of the null-class of the Euler-Lagrange operator, NE), in ascending powers of a,, ;, 


if 
is given by >) O,(By; 4, ;), where 
s=0 


0; (By; Ay, i) =B 


5 ia aegenslaie le ees 
Oy 41+ Ao tOy, 14 a i. 


(1) 
(2) The B’s are of class C and depend on «x and a, but not on a, ; 
(3) 7+41=max (2, min(n, N)) 

(4) 


4) Fors =2, the B’s are completely antisymmetric in the Latin and Greek indices 
separately. 
(5) Baa, = Yi, Bees s— 0 (3.9a) 
Bravinash =e 20, boyy bie, me BS eps s=1 (3.9b) 
B,, ty...%5 ts, ges 44 a Oissa |e 11.--s+1 Is+1 
; Ae SEY (3.9c) 
Oy 21.» Hg tg, Oey ee ei dee a cealeceal ts, a, 
Toc wane tae Bay iy. : ) Nee (3.9) 


09 17, Ay, 


Proof. If A(a,; @,,;;#) is a general element of the null-class of the Euler- 
Lagrange operator, then by definition 


4p 144A, ay gap + 96,14, ay gag + % A’; ier 0 

must be an identity in a ;;, ds ;, and ag for all # in a given Q*. This identity can 
be satisfied for all a, (a) which are assumed to be of class C? only if dg ;;4 a, se 
is an identity in a, ;;. Due to the assumed continuity conditions on dg, dg ;; is 
symmetric in 7 and 7, and thus it is sufficient to require 


AM a, = 0 (3.10) 


14, mE, n Gym 4m 


in order to satisfy a, ;; A 
in a, ; for A: 


ay, ag; =O Consider the following formal power series 
ry 4 7 


ele 0.(8 2 %,;) 


where 


Ope De O, 8B . — a Drees 


O11, Ak tp 4, 11 ak , tk 
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and the B’s depend at most on a, and a. From the form of O,,(n > 1) we have 


Nae byt Oh th xe Op tpee oO Bi Ly oes X]T] 00 hh OR ov (3 4 1) 


for all k<l<n. It is immediate, upon inspection, that O, and 0, satisfy equation 
(3.10). By equation (3.11) we have 


OF ays —=15 Ds. ty %g—Lis—1 UM Beis Fs 
and 


Oa eatin = s(s = 1) lop ee Gy, i, * + Gas_9,is—2° 


Hence, to satisfy equation (3.10), we require 
| 
= > s(s + 1) Yo, iy ere id ees eS (Bo 41 ...%s—Qis—QumEn T EO ee " 
S: 


For this to be true for all a, ;, it is required that 


=0. 


l 
By, 41... s—Qis_guméEn T 15 11 ...%s—Qis—QunEm 


This result implies, by equation (3.11), that B,;.,;, must be antisymmetric 
in all Latin and Greek subscripts separately. With N the numerosity of the 
index set and 7 ranging over an m-dimensional space index, By, ;,4,:;,=0 
for all k=max[2, min(n, N)]=r+1. We thus obtain the following finite series 
as the most general form which will satisfy equation (3.10): 


A= 3'0,(Byi days) (3.12) 
s=0 


To satisfy the identical vanishing of the Euler-Lagrange derivative of A it re- 
mains to satisfy 
sw A Gu, mag + Om A A a = 0. (3.13) 


> au, ™m™ cieed 24 


Substituting equation (3.12) into equation (3.13) and using the properties of 
the B’s thus far established, we obtain 


y 
ae Ay, iy 3a Ay. 4, is—] Ge in (By, ty con Xg—] ty—J fe i ag 


ars De, Dee Geo Wee ied Om CBs h i AG Ses) i 


rae Wes, ts oes Was, ts (Bap 11 100 Xs tae =0 


as an identy to be satisfied in a, ;. This identity can be satisfied for all a, , if 
and only if the coefficients of each a, ; vanish separately. Thus, upon making 
the substitution 4=«,,,, we obtain 


S (By, i, +1 % 1 Vp AG i), ax, (Be ty 1 Xs a A s44 ate (3 14) 
+s l)e, Bag here eee eee 
1B Se aie gels Boe Sata. han 20), See (3.15) 
If «, and «,,, are interchanged in the development, we obtain 
Ss ( ~ hy 11.4. X%g_1 ts_1 Xs ‘ide = (Be ty 10. Xg_1 Ts_1 Asay als aK, ae 
“oe (s = 1) Cre KBs ty 0X1 ts_1 A544 tg Ky #8) = 
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for 0<s<yr, Adding this to equation (3.14) and using the antisymmetric pro- 
perties of the B’s with respect to the «’s, results in 


s{(Bays, vee Ag Tg] Xs eee ah: Cea eas Ig—1 Us41 is), a 
= Coe tg] M44 ie) sac is ULAR ie as Po aR OS TY, 


since for that value of s (7.e., s=1) the above equation results in an identity. 
It is thus required, for s > 1, that 


Lee BY. Bs eae ae ar (Byes: Toe he ee Real Maes = 0. (3.16) 


Combining these results, the following system of equations is obtained which 
the 5’s must satisfy in order for A to be an element of the null-class of the Euler- 
Lagrange operator: 


0;, By ty = B ay? 
Bae an, i 2 os Be 11 yt - Ba) 11, Ady? 
Be Ty eg Ig, AKs+4 = Ge Oy 11 6 Ag tg Xs 1ts41 ? 
ie (<< Se 
Boe SSCS Ig = sie) G0, Lyman By, 11 ++. Xs—1 T5-1 Hs 41 ts, Bary? 
Yr By 4s 00 My —1 tp 1 fh ly, Got, Be tye Oty ty, Ay? 


which proves the theorem. Q.E.D. 
If y=1 in Theorem 3.2, we obtain 


di, Be Ta By ao, ? Oo%2,dx,~ ~~ &1%2,Aa? 


which are necessary and sufficient conditions for the existence of a variable 
n-vector function I'(a,, x) of class C* such that 


BO) ier By eet 


» ay" 


We shall obtain a particular solution to equations (3.9) for illustrative purposes. 
Starting with B= 0; I, we obtain by direct operation 


2 
BO oF Le any ar Ci, Lig to? 


By Ae Ee 1112, Ags “1 Oi, Ras Oe t2 73? 
Ba ety Mysims VOAub eR Cy ul pT ty, AX, 
where 

iS 11 +. Ob if tae tie aR RARER) Se 0) 


Es ty 10 Xh-1 1R-1 Le i 11 «+ XR—1 th-1 th, ety, = k 20, 


is = 


rt ORR RAL, Aoepty Wep+e 


and all I”’s are of class C2 and antisymmetric in the 7’s and the «’s. Thus if 


Sars Ol =U, 


; A ato : rs )a 5 fhe 
141 ++6%s—14s—1 1s, Aa, | Ci al aria Aa SED T= SEB 


and if 


Ss ve ? O} De 14 on. by—] Vy—1 Vy, Ay Ge. i; Ay, ty 
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Substituting these results into equation (3.12) of Theorem 3.2 gives 


A= (0,5 a L, a i,) si (Opt Des iy i | et ny Ay, i, Dy.) oe eee 


11, doy A, 
ae (0;, Le Upraee pe tymty Bey it * Gays, ipa aa Us by 1 hy—1 by—1 Uy, Uy Coe Gy, iy) ; 
Now 
aS iyo Op Op R41 We, iy Dan, i), tR+1 
=e I cae repair Loiiztpipipe ena) Aan Fae 
+k Ly dvi pth tpg Con, ter" Dap, th ipl 


of which the second term on the right vanishes due to the antisymmetry of the 
I’s and the symmetry of a,, ;,;,,, in 4 and 7,,,- Hence, upon rearrangement, 
we obtain 


A= (Li, See Sena Sips ale yes ii Op tty t Coyne Bo Aa ee (3.17) 


The usual statement! made concerning the null-class of the Euler-Lagrange 
operator is that a divergence is variationally deletable. The above particular 
solution bears out this statement only under the very special conditions placed 
upon the I’s. In other words an arbitrary divergence is not necessarily varia- 
tionally deletable. The special form of the A above can be interpreted as re- 
quiring that the divergence shall not explicitly contain second derivatives. 


It is thus evident that if two functions g(q; q,;;#) and A(q;q;; a) are such 
that {E| g}, ={E| /},, then there exists a function U(q; q ;;#), which is an element 
of the null-set of the Euler-Lagrange operator, such that h=g+U. 


Correspondence between Lagrangean and Hamiltonian Systems 


Equations (3.3) are N second order partial differential equations for the 
determination of the N functions q, (a) which are such that the functional 


Sali (qye qe eee 
Dn 


has an extremal value. We now wish to decompose these equations by varia- 
tional methods into a particular set of N(mw+1) partial differential equations 
of the first order which are such that all non-linearities are of an algebraic nature 
only. We commence by defining 1 N new variables p,; by the equations 


Put lyae: (3.18) 


Under this substitution, we have from equation (3.2) 


{elf jqg dae Pa (3.19) 

It is noted that equation (3.18) yields a well defined transformation de; le 
if the condition det| f da, 6.98, j| = 0 is satisfied. 

Consider the function H defined in terms of the N+2nN-+% variables 

da) Wa, ii Pai» & by def 

HT Pap dae) as ae (3.20) 


10 Lanczos, C.: Rev. Mod. Phys. 29, No, 3, 339 (July 1957). 
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Computing the differential of H, we have 


4H = qy i 4 bai + boi @ Ie, i 
= fay? Ia — Fag, 4 4e,i 
— 6,fdx,, 
which by equation (3.18) gives 
aH = qa,it Pai = ae Nee 0,f dx, 


so that H may be considered as a function of £,,;;q,;a. (It is noted that H 
is of class C? in its arguments, since f is, as seen by equation (3.20). From the 
differential of H we have 


qa, i a Hf Pai 
Fae alas (3.21) 
— 8/=8,H, 
and hence equation (3.19) becomes 
it ea LL apy ies (3.22) 
Upon combining the above results, we conclude that 
qui= 1, ; Pie Fly, (3.23) 


which states that (f,q) form a Hamiltonian system base H, possessing the 
desired property of having only algebraic nonlinearities. 

If the matrix Cf oa,292,j) = Pai qg,;) 18 nonsingular, we may solve, at least 
locally, for qg ; as a function of £,;;q,; a by equation (3.18) and thereby obtain 
H as a function of (f, q, #) from equation (3.20). It is evident that if (7 da, 196, j) 
is nonsingular, then (1 by i pj) is nonsingular since 


(7 bx i bp;) =a (da, 4, 59, )= (hai 14g, vs 1 — = (7 da, i 9B, )e m3 (3.24) 


Consider now the function f defined by 


1 (qa: qa, ir DE: (eas x) ae DEER A (p, q, x). 
Then . 
af = yi 4 da,¢ + qx, i 4 Pai a ep Bae ee og CHa. 


If we consider p,; as a function of q,; q,,;;@ defined by 


de,i= H 


» Pai? 


which is possible if det | », ;».,| == 0, then 


af = yi 44e,i— 1, 44a — 0,H dx’, 
and hence 


Fags Pas’ (eh Ee f= — 6,f. 


Comparison of the above result with equation (3.22) shows that we recover the 
Euler-Lagrange equations. We have thus proved 
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Theorem 3.3. The Euler-Lagrange equations and the Hamiltonian equations 
ave equivalent provided the matrix (f ay jap.j) nonsingular, in which case 


(7, de 4 ap,;) = (7 bai pai) 


Canonical Maps and Variational Considerations 
Consider the function h of the N+ 2n N+ n?.N-+n variables q,; 44,45 Pxi3 Pai, j> © 
as defined by 


h= xi 4a,i— Hb, q,&); (3.25) 

then 
{E| Aha, = — Ha, — Pai,is (3.26) 
| {E| Who = Iai = 15,5: (3.27) 


Hence, for such an h, the Euler-Lagrange operator results in the corresponding 
Hamiltonian system for the corresponding f. 
Consider a transformation of the collection Z on the variables (p,q). We 
reformulate the definition of a canonical map in terms of Euler-Lagrange operators. 
A transformation T of the collection 7 is said to be canonical if and only 
if there exists for every H CH a K CH such that 


HE| hho, = — K 9g — Pai, a 
ME | h}p, ; = OF i i es 


where / is defined by equation (3.25) and w is a nonzero constant. 


(3.28) — 


The equivalence of this definition and the previous definition of a canonical 
map is seen as follows. By Theorem 3.1, the Euler-Lagrange operator is invariant 
under all T of the collection 7. Thus, by equations (3.26) and (3.27), if (f, q) 
is to form a Hamiltonian system, 

{E| hha, = 0; E| Pb rye = 0, 
then 
{E| hho, =0, {E|h}p ,=0 
so that (P, Q) form a Hamiltonian system by equations (3.28). 

We are now in possession of the results required to obtain the alternate 

characterization of canonical maps. 


Theorem 3.4. A map T of the collection FJ is a canonical map if and only 
if there exists a nonzero finite constant uw and a K for every HES OD> such that 


- (Pai qa, i i H(, q, x)) 
Fe Qn, aes K(P, Q, x) 


by at most an element of the null set of the Euler-Lagrange operators 


differs from 


{E| }p, and {E| ho, for all x in DR. 
Proof. Consider the function @(P,,; PB. gO; O, 75 2) delined: by 
Oia Bs Os K(P, Q, @). 
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Application of the Euler-Lagrange operators {E| }p , and {E| }o, results in 
1E|P}o, = — Ko, =F 


{E|o}e,; a Kip, 


Comparing these results with the defining equations of a canonical map of this 
section (that is, equation (3.28)), we have 


HE|ho, = {El Pho, 
HE |hhn = {El PhR,: 
in order for T to be canonical. This condition can be satisfied only if wh—g 


is equal to an element of the null class of the Euler-Lagrange operators {E| S00 
and {E| }p , as seen above (cf. p. 124). Q.E.D. 


Equivalence of the Two Characterizations of Canonical Maps 


We now proceed to show that this alternate characterization leads to the 
same conditions on a map T in order for it to be canonical as obtained in Section IT, 
namely that the associated Jacobian matrix of T must bea generalized symplectic 
matrix. 

From Theorem 3.4 we see that a necessary and sufficient condition for a 
map 1 3p. .==p, (2, 0, x), d,—4, (2-0; 2) to be. canonical is that 


lu (Pai Go, i = A(p, q, )) = Ps; Qs i+ K(P, Q, &) 


differs from zero by at most an element of the null-class of {£| }pg, where (A, q) 
are considered functions of the (P, Q). We have 


WA Prede 1) te, Ung K) CV (E). 


From equation (2.2), the derivatives of q, may be calculated considering q, as 
a function of P, Q, x; the result is 


qa, i ae 0; Se Se da, p, Paki = da, 9, 22,4: 
From Theorem 3.2 we have 
NE) =B+ Ba; Orit Bang Pang Bais; Qi Qs i+ 


where the index set of the a, of Theorem 3.2 is such that the a, are in a one- 
to-one correspondence with the collection of functions Q,, P,; and where the B’s 
satisfy equations (3.9). Hence, 
Mt Pai (8; qa da, p, Pari dao ace ) 8s ae P; j Qp, Ae ali 
= B+ By; Quit Ba; Heer Aeon ee 
ia Byij pri tai, jena 2B sian & PO sia een 
(x7 means that the range of the B’s in Theorem 3.2 is such as to include « and 7 
in one index set of the Greek suffices). If the above equation is to be satisfied 


for P, Q which are independent of H, the coefficients of the derivatives of P, Q 
Arch. Rational Mech. Anal., Vol. 5 9 
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must vanish separately, giving 


Pej Oi Wy — BK eB, (3.29) 
Pai Ia,9, — Pai = Baw» (3 30) 
Pai Ia, p, = Bari Gee 


and all other B’s must vanish for all values of their indices. Thus equations 
(3.9) reduce to 


a ae Bs; , 


io 


which implies that there exists an m-vector function T of class C? such that 


B= ef 


a4? Bai oa I; Ay? 
ay 
and hence, under the identification of the a, with (P, Q), we obtain 
Of; =¢ bai % Ia -MH+K, 
Tg, =U Pai de,o, — Pair (3:32) 
Tp, ae U jee qa. je 


For these equations to possess a solution, it is both necessary and sufficient that 


(2; [7) 0, = @(Li 9). (3.39) 
(2,1), yy = (Ep, ) (3.34) 
(Li 9,),08 = fe 03), 02" (3.353 
Li 09), Pak = (Zip, ),op» (3.36) 
Li, a) te Lim), Pa (3.37) 


where the /”s stand for the corresponding right sides of equations (3.32). 
Equation (3.33) gives, upon using equation (3.32), 


fae ONO 9; Pai + Mb Pai 9; da, 9, =U Pai. Q, 9:da— MH 9, + K 9,. (3.38) 
Now 
fl he fH, Paig, + Fag, Ia, 9, 
which by the use of FL pi = ei» A y= —Pai,; becomes 
H 


OM) Sar Gai Pee Dae EO. 
However 


Pon = Pai, og Qs. i ar Pai, py aia 0; Dit 


Gai = ‘Te, 5 Qs, i in de, P,, Pant 0; qa» 
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so that 
Ho, = Puig, (Ia, 04 Qs. + Be, Pas Paes = 0; qa) — 


ee Ia, 0, Pai, og Op,it Pai, wy" p,i ey Pei): 


Substituting these results into equation (3.38) results in, after cancellation of 
like terms, 


Nee Ia, 9, (Pai, og Q6.i as Pai, py, Ps, 1) — Pai, o, (a, 9, 25,1 ir da, F, Fey,i) aie Ko,= 0. 


To simplify the writing of this and subsequent equations, we introduce the 
Lagrange-bracket vectrix components defined by 


{i| u, v; pg= Pin day — Pai Ix, a 


This operator will be dicussed in detail in Section IV. 


With the use of the above-defined vectrix operator, the foregoing equation 
may be written as 


lige itt|P, pir Oar P q} + fh Qei{t| Qs, Qa; pq} ae Ko,= 0. (3.39) 


In a similar manner equation (3.34) results in 
MPa, ielh, Ah» Pai; p q} + Qs, it it 1| Qs, Ph, pq} le KE 0. (3.40) 
Proceeding in the same manner, equations (3.35), (3.36), and (3.37) result in 


Je, Pe, Pat, Ba da, p,, Pai, p= 0, 


1 
a, p,, Pai,og— Pai, p,, Ia, 06 = ra rm Ong Orn) 


dao, Pai, og Ia, Og Pai,o.= 0 

which, when expressed in terms of Lagrange-bracket-operators, give 
Pan, Faji b ay = 0, 
{i Pa Ops P.O} =~ Op Die, (3.41) 
{t|Qz, Q,;2 q} = 0 


fi|u,v;p gt =—{t 


which follows immediately from the definition of {7|u,v; pq}, we have, from 
equations (3.39) and (3.40), 


Noting the relation 


F5;,4 982 9:;-+ Ko, =9, 
=i Qs i Ok Ogat K py 0; 


These are just the equations required for the (P, Q) variables to be canonical. 
Since equations (3.41) result in the Hamiltonian equations for (P, Q) for any 
H€S © , under the assumption that (f, q) form a Hamiltonian system, we 
have proved 

9* 
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Theorem 3.5. A map T of the collection Z is canonical if and only 2f 


{t|Pi,x, Psi; Pat =0, (3.42) 
, 1 
Pres Ops b 5 =~ Ong Org» (3.43) 
{t|Q,, Q,; Pa} =0, (3.44) 
where js is a nonzero constant and 
{t|%, 0; PO} = ai y 4a, y — Paty Ia, 4° (3.45) 


We now show that equations (3.42) through (3.45) are equivalent to the statement 
that the associated Jacobian matrix of the map T is a generalized symplectic 
matrix. 


Theorem 3.6. Equations (3.42) to (3.44) amply 
M'IM= “1, (3.46) 
L 
and conversely. 


Proof. Expanding the left side of equation (3.46), using equations (2.15) 
through (2.19) and (2.3), gives 


oa eas Be ieiieates 
B’E,C+D'E,A BED) E.B 
Now 
A'E,C+C'E,A 
a (da9,) (Paro, 1 Foe + Pano, &n) ne: 
(dx, 9,) & 
= (Gite) o en Gees (3.47) 
«LO, ? ? CEO : a aie 
(da, 9,) &n 
= ({1/0., Qs; 2 a} 1 + {2|0,,0; b gh es + 
+ {3| Qu, Qa; P gh es +--+ {n]0,,Q3|P aren), 
and 
A'E,D + C'E,B 
= [({(1 Fai, Qs; PG} 1 +:--+ {n|P.1, On; Pah e,), 
C1] a2, Op; Pa} e1 ap de tlhe, Os, Prey yt (3.48) 
((1 Lynas Pay ey aa AP On p at 2.) 1 
Similarly, since B’E, C+ D' EB, A= —(A'E, D+C’'E, BY’, 
pane ({4/ Pur, Ops bahay +--+ + {n|P, TO sp. Gize.), 
BLES Ce DE Ales : ieee haa . (3.49) 


(1B. Qs: pq} os +... + {n|P,,, Oe pq} zy’ 
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Finally 


en ue & (2 |B, a1? Foi PD) é ({2|P xn? Ants Pd) 
BE,D+ D'E,B = - (3.50) 


é; ({i| Dy, Py Pa). “BaP an? Py PQ) 


The theorem thus follows directly the substitution of equations (3.42) through 
(3.45), which are necessary and sufficient conditions for T to be canonical with 
multipher mw, in equations (3.47) through (3.50) above. Q.E.D. 

A very interesting result can be obtained if we compare the results of Theo- 
rem 3.5 with the results of the consideration of the remainder function in Section II. 
From equation (3.29) we have 


Keele B — Pai Oj da 
Comparing this with equation (2.49), that is, 


K=uH —U, 
we have 


OU = tt bu; % Ia — Jaze 
We have seen, however, that 
= 041 (PO .2), 


and hence 
U—=— ibs, Os05- 0,1, (P, Q, x). (G54) 


The development given in this section is similar in form and organization 
to the development given by SIEGEL" for the case m=1. The only literature 
concerning canonical maps on Hamiltonian systems for 7 > 1 is given by LANCc- 
zos™; he requires that the form #,; q,,; change by at most a divergence. 


Alpha Systems and the Calculus of Variations 
The results obtained to this point have been for systems of equations of the 


Founly( 2:1). ihe vectrix I which has been fundamental in the development thus 
far was specifically tailored to fit systems of equations of the form (2.1). We 


shall show that a majority of the results carry over to cases where Tis replaced 
by a vectrix of a different form which is suggested by variational considerations. 

Let 8 be an index which ranges over an index set # of finite numerosity B, 
and consider the functions ¥%(a) arranged in a column matrix {W}, where ¥, 
are of class C? in a Y* under consideration, and are such that they extremize 
the integral 


Ua CU AR (3.52) 
Dn 
where # is given by 
={V} Ae V{P} — H(%, 2), (3.53) 


> 


Aisa constant Bx B vectrix, V is the vectrix operator introduced in Section LH, 
and H is of class C? in its arguments. Applying the Euler-Lagrange operator 


1 SIEGEL, C. L.: Vorlesungen tiber Himmelsmechanik, p. 1—11. 
12 Lanczos, C.: Rev. Mod. Phys. 29, No. 3, 61 (July 1957). 


132 Dominic GARDINER BOWLING EDELEN: 
to ¥ and setting it to zero, which is a necessary and sufficient condition for the 
extremization of %, (3.53) gives 

V §PY e@(A—A) + A yy =[01. (3.54) 


The form of & given by equation (3.53) contains a superfluous symmetric 


term which we now proceed to eliminate. Separating the vectrix A into symmetric 
and antisymmetric parts, we obtain 


(Py ho V (PY =EEPY (A+ A) OV LP} TAY} (A— A) 0 WEP}. 


Now 


{E{QPY (A+ 4) © V Peng = V APY 0 (A+ A — A A) = [0], 


so that the Lagrangean given by equation (3.53) may be replaced by 


L=1IYY (A— A) VP} — HY, ar) (3.55) 
since they both result in the same system of equations for the determination 
of {¥%, namely equations (3.54). 

Set 
a=A—A’; (3.56) 


then using equation (3.56), equations (3.55) and (3.54) become 
V{PY e+ Hevy= [0], (3.57) 
LLY GeV YY — HY, 2). (3.58) 


Assuming & ¢ is nonsingular, it is evident from equation (3.57) that {¥} forms 
an alpha system with structure vectrix @ (cf. Section II, equation (2.13))4. We 
thus have the following 


Theorem 3.7. Every system of equations resulting from the extremization of 
an integral of the form { Ldv, where L is given by 
Dn 


L = WY Ao VP} — HY, x), (3.59) 


A is a constant vectrix, and det|(A — A’) e(A—A’) 
with structure vectrix 


+0, forms an alpha system 


mie? ese (3.60) 
and £ may be replaced by 
‘L=1LEPY GeV LP} — HY, x) (3.61) 
with no loss of generality. 


Let us examine the Hamiltonian system from the standpoint of its being a 
special case of an alpha system. We have seen that the Lagrangean for a Hamil- 
tonian system was given by 


L = fyi Ia,i — H(p, q, x). (3.62) 
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Let ae 


zs ; (3.63) 


a 
| 

S 
> 


then 
Pai Qn,i = {RY Ae VER} 
where {Rk} is the column matrix introduced in Section II. Thus, 
L=LRY eV {P} — HR, x), (3.64) 
and equation (3.61) becomes 
VERY 00+ H ex = [0] 


= 


where = 
a=—A—A’. 


From the form of A given above, by comparison with equation (2.3), we con- 
clude that a = —TJ, and hence det |@ea@| =£0, so that 


— V{RY oT +H exy= [0], 
which upon taking the transpose results in 
To V{R} + {Hx} = {0}. 
By Theorem 3.7, we may replace #= ERY Ae V {R} —H(R,x) by 
‘G =1ERY(A— A) o VERY — HR, 2). 


For this latter form of \# we see, upon using equation (3.63), that £,; q,,; is 
variationally equivalent to 


eat dai = Pais a 
which implies that £,; q,,; 1s variationally equivalent to —#,; ;q,- 
The result is evident by Theorem 3.2 since 
ee Gan, i a (Doi de), i os Pas Gas 


but is interesting in that it is arrived at solely from structural considerations 
without any consideration similar to Theorem 3.2. 

As a further illustration, we apply the general method of alpha systems to 
the problem of Hamiltonian systems with constraints of the form 


Fie; Re, — 1, (R, £2) =0, 7k SAR 
where F:,; are constants. Rewriting the constraint equations, we have 
F,¢ V{R} — j,(R, x) =0 
where F, is a row vectrix defined by 


F,= [Fai ei Fai 38 
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Multiplying the equations of constraint by undetermined multipliers 7, and adding 
to the \¥ for the Hamiltonian system gives 

L=EPy de VIL} — H, 

H=H(R,x) +, f,(, 2), 


, {4 
J=): 7, 
F, 
te [eee 
a => | a , 
J’ O 


from which we may proceed by well established methods to obtain the system 
equations in the presence of the given constraints. 


Section IV. Properties of Canonical Maps 
Group Properties 
The first properties we shall establish are the group properties of the various 


types of canonical maps considered in Section II. These group properties are 
completely delineated by the following theorem and its corollaries: 


*Theorem 4.1. The collection of all canonical maps form a group. In particular, 
if T, and T, are any two canonical maps with associated Jacobian matrices M, and 
M,, multipliers i, and p., and remainder functions U, and U, respectively, and 
M,u, U are the associated Jacobian matrix, multiplier and remainder function 
respectively of the canonical map obtained by applying T, after T,, then the combi- 
natorial laws for M, uw, U are 


WEEE (4.1) 
Namah Wat (4.2) 
U = U, + pe Uy. (4.3) 


Proof. By Theorem 2.2, the collection of all canonical maps are those of the 
collection .7 for which there exist nonzero constants 4 such that the associated 
Jacobian matrices, M, of the transformations satisfy the vectrix equations 


MIM= : rp 
For clarity, we note 
{Ry > {RY COB {Ry}, 
{R} at, fR}, 
{RF ={R(R,x)}; {Ro} ={R,(R,,a)}, 
{R,} = {R,(R, x)} = {Ry(R, (R, a), )} 
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where the {R}’s are the N(7 + 1)-element column matrices introduced in Section II. 
It follows immediately, from the Properties of matrices, that M=M,M,. Thus 


M'IM= M, ‘MIM, M,. But MIM, = 1 and UE “ ny i since both [ 
A 


Ms 
and 7; are assumed to be canonical ; hence M’ IM=- I. We may thus conclude 


Macteg. 
that there exists a multiplier w=y, 4. such that M’IM= ' jf and hence M is 
bb 


a generalized symplectic matrix of multiplier w. Thus, the product of any two 
canonical maps is a canonical map (product interpreted in terms of the successive 
application of canonical maps). From equation (2.50), the remainder function 


is defined by {Uz }=Ie ViRy. Now Ie Viky= Ie VIR STARS, cep ° Vik 
which, oie TR 03 = M;", gives Je V {R,}=Ie V {R} + I Mzte Ye {R,\ 
= {U,, p,}+1Myz* V {R,} by definition of U,. However, Mz*’ IMz*=wI, so that 
IM3*=1,.M41, and hence 
Ie ViRss = {U,,p,} + ul, Ma Te VIR, 

as {U2 RJ + Me M3 {U1 rb, 

= {U2 pr} We He {U,, RS 
Thus, the composite remainder function is given by U=U,+ ,U, to within 
an additive function of # which may be deleted with no loss of generality. Equa- 
tions (4.1) through (4.3) are thus established. It now remains to prove that the 


collection of all canonical maps contains the inverse of every element and the 
identity element. Taking M to be the identity matrix, U=0 and w=1 result 
in {R}= {RY and K=dH by equation (2.49) and hence represents the identity 
map. From equations (4.1) through (4.3) it is evident that if M, is the inverse 
of M,, then M, M,= E* whence yp, u,=1 and 0=U,+u,U,, so that M,= M3", 


en hU,= — 2 U,, and hence the inverse of every canonical map is a canonical 


map. FOE D: 

The group of all canonical maps, established by the above theorem, is a 
generalization of what is called the symplectic group’® for the case n=1, and 
will be referred to as the generalized symplectic group. It is the generalized 
symplectic group which is the invariance group for systems of generalized 
Hamiltonian equations. The system of equations (2.1) or their vectrix equi- 
valents, equations (2.12), are invariant in form under this group for all bases 
functions. 

The following corollaries to Theorem 4.1 are immediate upon noting that all 
the subcollections of canonical maps considered contain the identity element. 


*Corollary. The collection of all uniform canonical maps forms a group, the 
uniform symplectic group. 


*Corollary. The collection of all basic canonical maps forms a group, the 
basic symplectic group. 


13 SIEGEL, C. L.: Vorlesungen iiber Himmelsmechanik, p. 9. Berlin-Géttingen- 
Heidelberg: Springer 1956. 
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*Corollary. The collection of all absolutely uniform canonical maps forms a 
evoup, the absolutely uniform symplectic group. 

Consider the subgroup of the generalized symplectic group which are linear 
and homogeneous in (f, q) (that is, for which {R}=M™ {R* where M=M(x) 
is a nonsingular matrix for all win Zt). For this subgroup we have MYNIMI=ul 


and V {R} = ( y™M) {R}. Now, by equations (2.48) 
{Ug} =e VER} = (VM ) {RY 
—[e iat MERY, 
so that the remainder function is given by the quadratic form 
U = {Rp Te (VM) MER}, 
and hence, by equation (2.49), 
K(R, ae) = H(MERY, a) — £{R} To (VM) MR}. 


As an example, consider the matrix M™ given by repeating along the principal 
diagonal an orthogonal N x N matrix v(x), that is, 


v0... 
0 v Cae 
~ VSM. 


M1= 
vo 


It is easily verified that M7 IM3=I, so that such a map is an element of the 
basic symplectic group and hence will generate a canonical map. For this case, 
the remainder function is given by 


Ul== z (LQ. (9; UO) ULP SY aa [F. iv) 'v{Qah) - 


There is a “‘factorization’’ of the generalized symplectic group by which we 
may represent any canonical map as the product of a homogeneous linear dila- 
tation and a basic map. 


*Theorem 4.2. The generalized symplectic groups admit a factorization into 
the basic group and the group of all homogeneous linear dilatations (all canonical 


maps whose associated Jacobian matrix is of the form = E* for any a bounded 
away from zero), in the sense that if M is any iyi bs matrix of multiplier 
then M=M,M, where M,= we and M,= Mj," M is basic. 

Proof. First we note that M,= ae E* is an element of the generalized symp- 
lectic group with eatin uw for M, = +7 and that the roar eli of all 
=; E* form 


a group. on Theorem 4.1, M, is an element of the generalized ce group 
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since Mand M, are elements, and M, has multiplier w.=y/u;=4/u=1 by equation 
(4.2) since M;* has multiplier 1/u. Thus, M, is an element of the basic symplectic 
group. Q.E.D. 

If we consider the N(m-+1)-element column matrix {R} as an ordered col- 
lection of N(m-+-1) numbers, then we may represent {R} as a point in a Euclidean 
space I of N(m-++1) dimensions by making a one-to-one correspondence between 
the values of the elements of {R} and the coordinates in J. Such a space J will 
be refered to as a generalized phase space. From the definition of the uniform 
symplectic group, we see that it is a group of isomorphism of J. If we fix the 
value of # in Y*, then the generalized symplectic group is also a group of iso- 
morphisms of J. Thus, the generalized symplectic group represents, in general, 
an m-parameters collection of isomorphisms of J where the parameters are the 
a whose domain is 9*. 

Let H be an element of Op , so that there exists at least one solution to 
equations (2.12), {R}= {R(a)}. We may interpret {R(«)} as an n-dimensional 
hypersurface in J. Thus, the Hamiltonian equations may be considered as 
generating n-dimensional hypersurfaces in J. When {R} is considered as a function 
x, then J will be refered to as the solution space J, (that is, in the solution 
space the {R} are assumed to be functions of # and hence are not independent). 


Lagrange Bracket Vectrices 


Many of the results of the classic theory stem from considerations of certain 
bracket expressions refered to as Lagrange and Poisson brackets. The question 
naturally arises as to the form and properties of the generalizations of the classic 
bracket expressions as determined by the properties of the generalized symplectic 
group. The forms we use for definition are similar to those used by WINTNER"™ 
although, as might be expected, we must use the structure vectrix of the system 
involved. 

*Definition. By the Lagrange bracket m-vectrix, L(u, v; R), is meant the 
vectrix collection of bilinear polar forms defined over the generalized phase 
space I by 


E(w, v; R) © {RY TER}, (4.4) 


where {R} is the ee ie column matrix defined in Section II and 
(u, v) are any two of a collection of parameters which define a hypersurface in J. 

The Lagrange bracket vectrix so defined is the generalization, for the case 
n> 1, of the bracket expressions introduced by LAGRANGE in 1808. We have 
already encountered the components of the Lagrange bracket vectrix in Section ITI, 
where they were used to facilitate the alternate development of canonical maps 
from the standpoint of the calculus of variations. 


Theorem 4.3. The Lagrange bracket vectrix exhibits (for any (u,v) satisfying 
the conditions of the above definition) the following properties: 


(a) L(u,v; R) = {i|v, 4; p Qh é; (4.5) 
14 WintNER, A.: The Analytical Foundations of Celestial Mechanics. 
15 TAGRANGE, J. L.: Mém. de 1’Inst. de France 1808. 
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where {i| u,v; p q} was defined in Theorem 3.1 to be py; , de a go day? 


*(b) L (u,v; R) =— L(v, u; R), (4.6) 
(c) L (des dg, K) = 0, (4.7) 
(d) L (Pais pp; R) =0, (4.8) 
(e) L(g, Pes) = benle (4.9) 
(f) Lag, Pj) ol (dy, Pani R) = Opy Oya Oj, (4.10) 
(g) Lu, 0; R) =[L(qy, 4; R) © L(Pp;,0; R) — 


(4.14) 


— L(qg,v; R) eL(pp;,4; R)] e;. 
*(h) Let T be any element of the generalized symplectic group of multiplier 
bb, T: {R} > {RY then 
L(u,0; R) =" L(u, 0; R), (4.12) 
and conversely tf (4.12) follows for any (u, v). 
Proof. Part (a) results directly from an expansion of equation (4.4). Part (b) 


results from noting that Tis antisymmetric (that is, 
L(v, u) ={RY, UR}, = {RY LER}, = — L(u, ») 


Srey (= =i From (a) we have 


> 


L(q,, dp» R) a {7| p> qy> p q} Gj a (Da 4,48 deg, zy Pai, ay de, gg) eg; — OU 
since and q are independent in J. Thus (c) is proved. Proceeding in a similar 
manner, (d) may be established immediately. Using (a), we have 
L (dg, Pyj3 R) = {2|Dyj. Ips BO} ei, 
im (Px Dy, da, ag —_ Pa i ag da. p,,.) Ci, 


=> > 
é. 


aie (Bai, Pry da, gg) is gy e;, 


since # and q are independent in J, and hence L(ap, p, ;; R) =03,é;. Thus (e) 
is established. (f) follows immediately from (e). Noting that p and q are inde- 
pendent in J, from (a) we have 


Liqy, 9; R) = 2p; 2, 


L(pp;, 0; R) = — dp,» 2: 
Hence 
L (dg, 4; R) ¢L(b5;, 0; R) — L(qg,v; R)  L(p,;, w; R) 
= — Pain Wey % 9S + Pei, Wp 4 2? eG 


me Ws, Pei» = V8» Pai u? 
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which by (a) is just the j" component of the Lagrange bracket vectrix L(u, oT), 
Thus (g) is established. By definition L( (u, 0; R) = ER} TERY su Nows Ry = 
MER ,» Where M is the associated Jacobian matrix of the canonical map consid- 


ered, so that L(u, OR ani M IMR} se grt EXER byeDheorem2.1. 
Since the last SUES is alu to J Llu, v; -R) ae definition, the es follows. On 
the other hand, if L(u, v; R) =pL(u, v; R), then ERY MIMER} , = = RY aie 


so that ERY x [MIM —- i: I r| ERY 7= O05 If this is to be truetor “alt u and v, then 
certainly it must hold for any (uw, v) elements of ER} and hence for ER} so that 


({8}, <a)’ [MM — I=0. Q.E.D. 


> — ; > 7 
I | ({R}, ci) = MIM — o @ 


a 
lu 
Part (h) of the last theorem allows us to calculate the Lagrange bracket 


vectrix for any RY obtained from a known {R} by a canonical map simply 
through multiplication by the reciprocal of the multiplier of the map considered. We 


may thus drop the superfluous R in writing L(u, v;R) (that is, Lu, v) L(u, 5 1) 


The Lagrange bracket vectrix possesses an additional property under the 
generalized symplectic group which is similar to that of the integral invariants 
of PoINcARE}®, 


*Definition. A transformation is said to admit an absolute integral, -%, of 
multiplier ~ and order W if and only if the integral of % over every W-dimen- 
sional compact point set in the space of functions over which the transformation 
operates changes only by multiplication by 1/w when the points of the space 
undergo the transformation considered. If w=1, 4% is said to be an absolute 
integral invariant of order W. 


*Theorem 4.4. Every element of the generalized symplectic group of multrplier 


uu admits the absolute integrals bat 
L(u, v) 
of multiplier u and order 2. In addition, the basic symplectic group admits the 
absolute integral invariants be 
L(u, v) 
of order 2. 


Proof. From Theorem 4.3 (h) we have, for any element of the generalized 
symplectic group of multiplier uw, L(u, v; R)= 7, E(t v; R) for any (u,v). If S is 


any compact two-dimensional point set in the generalized phase space J, then 
it may be characterized by a two-parameter system (w#, v). Hence 


[ Liu, v; Rydudo= [7 L(u,0; R) dudv 

S s 

for all S in J. The second statement follows immediately upon noting that the 
basic symplectic group has only unity for the multiplier of its elements. Q.E.D. 


16 PorncaRE, H.: Acta Math. 13 (1890). 
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Lagrange-S Bracket Vectrices 


There is another vectrix bracket whose form is similar to the Lagrange bracket 
vectrix, but which arises only in the case of more than one independent variable. 


*Definition. By the Lagrange-S bracket vectrix LS(x', x'; R) is meant 
the vectrix collection of bilinear polar forms defined over the solution space J, by 
LS (xi, x1; R) & ERY TERY. (4.13) 

From this definition, we have 


*Theorem 4.5. Let T be an element of the generalized symplectic group with 
associated Jacobian matrix M and multiplier yw; then 


— 


LS (xi, #1; R) = A [ES(#, ai R) + (8,£RY)’ 1, {RY — 
BY. R— (0.68) TB.) 


Proof. We have seen in Section II that {R} ;=M {RY ,— Mo; {RY under 7. 


Thus a 
TE Si( x, x): R) = iKt ie 


= ({RY;M' — 0,{RY M’) I(M§R} ,— M2,£R}), 
= (ERY; — 8, {RY) M TM(ER}, ; — 2, £23), 
= © ({RYs — G4 {RY) TERY; — 28), 
= = (RY TERS, | — (248) TER}, — 
— {RY 16, {RY + (8, {R})' 16, (RY). 


But LS(x, xt; R)= ERY, Hf L{RY. by equation (4.13), whence the result follows. 
ORE: 


*Corollary. Under a uniform canonical map, the Lagrange-S bracket vectrix 
changes only by multiplication by the reciprocal of the multiplier of the map considered. 


(4.14) 


Proof. The result follows immediately from Theorem 4.5 upon noting that 
0; {R} =0 for all elements of the uniform symplectic group, by definition. Q.E.D. 

*Theorem 4.6. Every element of the uniform symplectic group of multiplier u 
admits the absolute integrals L S(x', x!; R) of multiplier u and weight 2. In addition, 
the absolutely uniform symplectic group admits the absolute integral invariants 
LS (x*, x4; R) of weight 2. 

Proof. The proof of this theorem follows from the above corollary by the 
same reasoning as in the proof of Theorem 4.4. Q.E.D. 


Let Ben det 
LS(Qax, x : ev ine oT {RY , (4.15 a) 


def 


LS (at, @ar; R) ERY Pe ERY. (4.15) 
From equation (2.12) we have Ie VERY == fH =}, so that V{RY el = {FH ph’. 
Thus equation (4.15a) gives LS(Q~a@, x!; R) = {Hp} ER} ; if {R} is an n-dimen- 


and 
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sional solution manifold in J, which constitutes a solution to equation (2.12). Now 


H , ={H z¥ {R},+0,H=LS(Q a, x; R) + 0,H 


or 
i LS(@z, ax; R) + VH. 
We have thus proved the following 
*Theorem 4.7. Let It be any n-dimensional hypersurface in I, which consti- 
tutes a solution manifold of the generalized Hamiltonian equations (2.12); then 
VH=LS(@w,«;R)+VH (4.16) 


1s an identity over IN, where 
LS(@w, a; R) ! (VERY eI) VER}. 


*Theorem 4.8. Let IN be any n-dimensional hypersurface of I, which constitutes 
a solution manifold to equations (2.12) for x in a given Gx, where both H and the 
solution {R} are of class C?; then 


Ve LS (x, “;R)= 0H; — 0H; 


att , zB: } (4.17) 
=0,LS(Q #, xj; R) — a,LS(@ a, x; R) 


1s an identity in M for all M. itr 
Proof. From the definition of L S(x‘, x’; R), we have 
Ve LS (xi, Mee ene Ve (ERY TERY ,) 
= [V {RF s) eT) {RE + {RE (Le VERE.) 
since I is a constant vectrix. In M, Ie V {RY + {H x} = {0} so that Te [VERY] , 
See eet V6 V (ERY ;) since {R} was assumed to be of class C*. Thus 
VoL S| (x, x ae = {HF rt i Ligiis {RY {7 eh 
However 
(7) ={A wt ARE + 5 Ai, 

so that 

VeLS(xi, d:R)=H,,—0,H,—H,,+0,H;=0,H,—2,H, 


since both H and {R} were assumed to be of class C?. This result may also be 


written as 
sas 


VeLS(x', 1; R)=0,LS(@a, 7; R) — 2,LS(@a, x; R) 


LS(& &, x!) +; H, and thus 


5 
& 
mn 
| 


0,H , — 0,H;=0,LS(@a,0;R)—OLS(@e,#;R). Q.E.D. 
Integrating equation (4.17) over Y* and using equation (1.16), we obtain 


f LS(xi,;R)eNadS= [ (aL LS (a, x/; R) —8,LS(@a@, x; R)) dv,. (4.18) 


GnO Dn Dn 


142 Dominic GARDINER BOWLING EDELEN: 
If 0,H,—0;H, is zero in I, then we have from equation (4.17) 


Os fi Ve LS(x, x R) dv,, 
Dn 
so that LS (x8, x): R) is conserved under equations (2.12) if 0;H_; —¢é,H ; = 0in We. 

The converse of Theorem 4.8 is of interest in its own right. 

*Theorem 4.9. Let {R} satisfy the equation [eV {R}+ {G(R, x)}= {0} where 
{G(R, w)} is an N(n+1) element column matrix whose elements are class C1, so 
that {R} defines an n-dimensional hypersurface MM in I,. If VeLS(x', #; R)= 
(8; {G}’) ERY , — {R}; 0; {G} holds at all points in Me, then there exists a function 
H such that {Gh= {H p} (that is, the system 1s Hamiltonian). 

Proof. Ve LS (x, a Re (V {R}';) T§R} + {R}'; 10 V {R} ;. In M we have 


> => 


TeV {R}= — £G} so that 
Vo LS (x, a; R) = {GY AR}, — ERY AG} 
But {G} ;= {G} exy {R} + 0; {G}, and hence 


VoL S (xi, x3; R) =[{RY ({G} cay)’ + 8 4{GH] ERY — {RY [LG cay {RY +3) {G¥] 
= {RY [LGB cy)’ — (GB ces | ERB, + (LG) {RB — {RE 8 {G. 


Thus, the hypothesis can be satisfied only if 
(LCF, cx3)’ = {Gb cry 


which is a necessary and sufficient condition, under the assumed continuity 
conditions, that there exist a function H such that {G}= {Hp}. Q.E.D. 

In Section VIII we show that any Hamiltonian system which is absolutely 
integrable is such that its solution may be represented locally by {R}= {R (a, {C})} 
where {C} is an N(m+-1) element column matrix of constants. Using this result, 
we have the following theorem, which may be thought of as constituting a cor- 
respondence between the considerations of Lagrange bracket vectrices and 
Lagrange-S bracket vectrices. 


*Theorem 4.10. Let IN be any n-dimensional hypersurface of the generalized 
phase space I, which constitutes a solution manifold of (2.12) for {R} of class C?; 
then V @ L(£C}, {C}) = O is an identity in M. Conversely, if {R} is a solution to 
TeV {R}+ {G(R, x)}= {0} where {G¥ is of class C! and Ve L(§C}, {C}) =O at 
all points in MN, then there exists a function H of class C2 such that {Gh {H x}. 

Proof. 

Ve L({C}, $C}) =V © [RY cy)! TER}, coy] 
= [V (ERS, gcy)’] FERS coy + {RB cp) LV ({BY gc) 
Now V ({R} ecy) =(V {R}), ¢cy, since {C} and # are independent in IM so that 
Te V ({R}. ¢cy) = Te (V{R}) «y= — {H x$.¢cy. From this point on, the proof 
proceeds in exactly the same way as in Theorems 4.8 and 4.9. Ost D: 
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Poisson Bracket Vectrices and their Lack of Invariance Under the Generalized 
Symplectic Group 


Let w and v be two arbitrary functions of class C} defined over J. 


Definition*. By the Poisson bracket vectrix is meant the vectrix defined 
by the equation = a an 
: P(u,v; R) = u gry 1 {vp}. (4.19) 
This vectrix is a direct generalization of the bracket expression first introduced 
by Portsson’’ for the case »=1. Although the form introduced by equation 
(4.19) will be shown to be non-invariant, its study is included for two reasons: 
first, the form of equation (4.19) admits direct commutation relations which 
do not depend on the dimension of the space &, as seen from the antysym- 
metric character of the structure vectrix involved; and second, this form of 
bracket vectrix admits a direct extension of the Jacobi identity for the classic 
Poisson brackets. 
The properties of the Poisson bracket vectrices are given by the following 
theorem: 


Theorem 4.11. The Poisson bracket vectors admit, for any (u,v, ...) of class C1, 
the following properties : 


=> 


(a) P(u,v; R) = [1] v, u; R] e; 
where [t|u,v;R]) =p ;U aa — Yay 4 dui? 

*(b) Plu, v; R) = — Plv, u; R), 

(c) P( pj, Py R) =O; 

(a) Play, ay; R) =O, 

(c Play, yj R) = 897%), 

“(T) Pu“; hk) = 0, 
*(g) P(u +0, w; R) = Pu, w; R) + Plv,w; R), 
(h) P(u*v; R) = P(u,w; R) *v + Plv,w; R). 


(i) If u,v and w are any three functions of class C? defined over I, then 
[é| i], 0; RI, wo; R)+ [Ej], w; RB], wR] 
+ [ij] w, 4; R], 0; R] + [al lelw, 2; R],w; R]+ 
+ [j|[é|x, w; R], «; R) + [y|[é|w, w; R],v; R]=0. 
Proof. (a) follows directly from the definition by expansion. (b) follows 


from the antisymmetry of J in a manner analogous to the proof of part (b) of 
Theorem 4.3. (c) through (h) follow directly from (a) upon noting that and q 
are independent in J. (i): Let 


(2,7 


VW; U) = 4 9,5 085%, 90 M198 — 90 P85, Pag 9p — 


GO itag Ip 280 © bps TY, ay ap Y, taj epi? 


17 Poisson, S. D.: J. de l’Ecole Polytech. VIII (1809). 
Arch. Rational Mech. Anal., Vol. 5 10 
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then hat ae 
[il], 0; RJ, w; R] = (i,j|v, w; u) — (2,7 |, @; 2). 
Noting that (7,7 Nea Gi ), the result follows. Q.E.D. 
Part (i) of this theorem is JACOBI’s identity generalized to the case where 
Ne . 
The question concerning the nature of the image of the Poisson bracket 
vectrix under an element of the generalized symplectic group now arises. Under 


a map T of the collection 7 (7:{R} = £R}) we have 


U gry = 4, gy {RF cn3 = 44M 


since M1 = = {RY gry: Thus, from the definition of the Poisson bracket vectrix, 
we find that 


P(u,0; R) = 4 ¢f% M7IM+’ fy 3. 


With respect to ERY we have Plu, v; R) = gy Tho x, so that P(u,v; R) will 
change by a constant multiple y under T if and only if 


u gM IM’ {uv zy =) u ck H {vz} 


is an identity in J. The most general condition under which the above identity 
is satisfied is given by elt Ee 
MUVIM?’=yI-+8, 

where B is such that u city BURY 10) Now, if this condition is to be satisfied 
for all choices of u and v, it must also be satisfied for w and v elements of £R}. 
Thus, it is required that ERY ey B{RY ee LQ) from which we may conclude 
that re must be a zero vectrix, and hence M+ IM- iy] is a necessary 
and sufficient condition that the Poisson bracket vectrix changes by the multi- 
plicative factor y under 7. Premultiplying by M and postmultyplying by M’ 
gives MIM'="'Tasa necessary and sufficient condition that the Poisson bracket 


vectrix change by a constant multiple only under T. If T be an element of 
the generalized symplectic group, then M must satisfy, in addition to the above 


equation, the equation M'IM= Lt by definition. Hence, the Poisson bracket 


vectrix changes by a TRGB O A constant under the generalized symplectic 
group if and only if the associated Jacobian goes of the was of the 


group satisfy the simultaneous equations oe cs TE. Mim = + +e aT, It is im- 
mediate that M’ IM — = T implies MIM' = r ' Tif M is either mths or anti- 


symmetric, upon setting y=, and if M is orthogonal upon setting y= 1/w. 
Setting 
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it is evident that such an M satisfies the equation MIM=1, so that it is an 
element of the generalized symplectic group. On the other hand 


é 0 z 23 i 
MIM’ =|—2, 0 —@A+eA) +E 
—@ Ahhh 0 


and hence, in this case, M'IM=~T does not imply MIM’ ="1 so that the 
Up 
Poisson bracket vector does not admit the property of changing by a multi- 
plicative constant factor under every element of the generalized symplectic group. 
In the case m=1, M’ IM= + Timplies MIM’=_1. Specifically, iP a3 i ey 
le 


as seen from equation (2.6), for ~=1, and hence wim=17 implies M’— 

> > > > > M = 
17M+1eF{=—1 IM 0eJ=- I;'eM"I, whence MIM’= J since there 
le Me Lu lt 


is no € such that CeJ—O other than C=O in this case. 
We have thus proved 


Theorem 4.12. For n greater than one, Poisson bracket vectrices do not admit 
the property of changing by a constant multrplicative factor under all elements of 
the generalized symplectic group, although this property does result if the associated 
Jacobian matrix is either symmetric, antisymmetric, or orthogonal. 


Theorem 4.12 constitutes the first major difference between the properties 
of the classic symplectic group and the generalized symplectic group. The 
difference, namely the lack of invariance of the Poisson bracket vectrix under 
the basic symplectic group as compared with the invariance of the Poisson 
bracket for the classic symplectic group, is of considerable moment in that it 
is the Poisson bracket vectrix or its functional equivalent which is used to form 
the commutation relations for quantum field theory. 


Left and Right-Hand Bracket V ectrices 


To alleviate the situation pointed out by Theorem 4.12, we inquire as to 
whether it is possible to construct bracket vectrices which will reduce to the 
Poisson bracket for the case m=1 and which, in addition, admit the property 
of changing by a constant multiplicative factor under any element of the gene- 
ralized symplectic group. 

In the classic case (n=1), the Poisson brackets are the reciprocals of the 
Lagrange brackets. We thus investigate the problem of constructing bracket 
vectrices which are the reciprocals of the Lagrange bracket vectrices for the 
case m >1. Let fu} be an N(w+1)-element column matrix of functions of {R} such 
that the matrix V= {uh} ery is nonsingular. From the definition of L(u, v) we 


s 


have L (fu, {u}) = (ERY guy) TERY. {up det P (fut, {ups RB) = {up cry A( fu} xy)! 


where A is to be determined by the requirement 


L (uf, {uy}; R) oL (Kup, {u}) = B* (4.20) 


Oe 
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From the above forms we have P ({u}, {u}; R) af! ({u}, {u}) = VA VieV’ Iv 
—VAeI=E* in order for equation (4.20) to be satisfied. Multiplying the last 
equation from the left by V gives V(A: ol) =V, from which we may conclude 
that A must be such as to satisfy Ae I= E* in order for equation (4.20) to hold. 


We now inquire into what form Y must have in order for it to change by 
a constant multiplicative factor under an element of the generalized symplectic 


group. From the definition of P we have 
L ({up, (uh; R) = (4, cry ACY, 0)’ 
Under a canonical map of multiplier «« we obtain {uv} {R}= {uy gy M™, and hence 
L (fap, {a}; R) = {uh cy M7 AM™ (Kup 35)’. 


If we set L (ful, fur; R) = {u} cit A ({u} a) ’, then ¥ will change by a constant 
multiplicative factor 7 under the canonical map considered if and only if 


M1AM+'=74A+8, 
{uh, c® B {4h cz)’ = O 


is satisfied. If we require equations (4.21) to hold independently of the choice 


(4.21) 


of {uw} (that is, that Lu, u; R) changes by a constant multiplicative factor 
under the map considered for all {uw} satisfying the condition that {w} gry is 


nonsingular), then it must hold for {wu} = ERX, from which we conclude that 
®& must be a zero vectrix. 

The map considered was assumed to be canonical, so that M satisfies M TMesen 
Thus, I= = M'T M2 from which we obtain E* = a Ae MIM upon Bae 
that Ae f= K*. This result may be written in the more useful form M= 1 AM-1'el. 
From equation (4.21) we obtain by direct matrix operations 4am = nMA, 


which, upon substitution into the above equation, gives M= ”/ MAelI. If we 
s y= eae. ~ 
require A to be such that it also satisfies AeI=E* (that is, Yu, u; R) is 


~ 


the left inverse of L(u, wu; R) under the map considered), then we have M= /! M 
so that y=. ff 


Set 


A ({uh, {u}; R) =— L' (fu, fu; R), 
(Eu, up; R) =— {0}, cay A ((0, cry)’ 


Since Ae I= E*, it follows that Ie (—A’)=E* owing to the fact that ’—=—I 
and H*’=:E*, Thus, we see that 


L (uh, {u}) © B( fu}, {u}; R) = Be. 


Summarizing, we have the following 


so that 
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*Theorem 4.13. Let {u(R, x)} be an N(n+-1)-element column matrix of func- 
tions of class C1 such that the matrix {u} gry US nonsingular. Define the right 
and left-hand bracket vectrices by 


Rw, v; R) =—v gry A’ fw x}, (4.22) 
a L(w,v; R) =v gry A fw ph (4.23) 

where A satisfies 
Ael= iE, (4.24) 


then 


L (Gu, {eds R) o Led, 00) = 
L({uy, up) © A (fap, {0}; R) = 
In addition, the images of Lv, w; R) and Rv, w;R) are equal to Lv, w; R) 


and R (v, w; R) respectively, and equations (4.20) are invariant under a canonical 
map of multiplier u if and only tf 


(i) n= ph, 
(ii) MAM =7A (4.26) 
where A ° I- = E* and where 
L ({up, {ups R) = (ug, ch A({4h, ca) (4.27) 


We first show that the collection of A which satisfies the conditions of the 
last theorem is nonvacuous. From the properties of Ij* we have Tel E*, 


and hence by setting AS [+6 where C is any vectrix such that Ce I—O, 
we satisfy the condition stated by equations (4. 24). 

From the definition of a symplectic matrix es at | we have A* e M’IM 
=! E* by SQUSIEGD (4. 28s Hence, A*eM'I=- —M- 1 so that MA*M’eI= 
a | Os = = A* eI since A* e1—E*. Thus we suet 

(M.A* M’ — 1 A*\eI= oy 
which by Theorem 1.2 gives 
M A* M' = ‘ (A* +0), 


where C is arbitrary to within the condition Cel—0. Hence, M’IM= 


| Timplies MA*M’= = (A* + 0). Identifying A* with A and A*--€ with A, 
lu lu 


which is permissible since (A*+ C) o[=A*el= E*, we see that equations (4.24) 
and (4.26) are satisfied. We have thus proved 


*Theorem 4.14. Let M be a generalized symplectic matrix of multiplier u, and 
let A be any vectrix such that 
Ael=E*; (4.28) 
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then there exists a vectrix C, for every M such that 


Cel=0 (4.29) 
for which M’ IM= : if implies 
M1A M2’ =A (4.30) 
where bs 
A=A+0, (4.31) 


and conversely. In particular, we may take A=Tf,*. 


*Corollary. The right and left-hand bracket vectrices change only by a multi- 
plicative factor 4 under every element of the generalized symplectic group of multi- 
plier wu. 

Proof. The proof follows directly from Theorems 4.13 and 4.14 provided we 
show that equation (4.30) holds for products of elements of the generalized 
symplectic group. Let 

MIM; =" (+ @), 
1 


M,1;'M; = (;* + G,); 


then 
M, M, I;' Mi Mi, = | M,(1 + C,) M; 
My 


= Un, Sy 

My My 

EL | eS el CITY ee 
heal ae) My 


Therefore, we must show that C=M,C,M, is such that CeI=O. Now Cel= 

M, C,eM3 0, but Mj I= : IM;1, since T is canonical, and hence C= 

i M, C, eI Mz'=O since Mis nonsingular. Therefore, setting M=M, M,, we 
2 


have ae ; = es 
Mi} M's == (F006) 
My Me 
where 


C=C, + 4.M, 6, Mi. Q.E.D. 


These equations constitute the combinational laws for the C under M, M,. 
In the classic case (n=1) the equation AeI=E* implies that A=~—T since 


‘ : ae OE a: ES fo. = 
in this case [= e; ioe a (that is, 7*=J7'=—1), and there is no vectrix C 
other than the zero vectrix such that CeI—O. Thus, the right and left-hand 
bracket vectrices reduce to the Poisson bracket vectrix for m= 1, as is evident 
by using the above results in comparing equations (4.19), (4.22), and (4523)5 
We have thus obtained the desired generalization of the Poisson bracket vectrix 
for the case n> 1. 
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It is of interest to note that for the matrix 


which is canonical with multiplier unity, we have 


ik 0 ie — €,/2 
MI;* M' = ei @4,/2 @ Ay — €2Ay/2 
€& —&A/l2+ eA €2 As/2 
so that 
a 0 0) 0 
C=150 @1A,/2 €, Ag — €94,/2 }, 
0 — eA /2+ Ay €Ap/2 


for which one may easily verify that Cel—O. 


The properties of the right and left-hand bracket vectrices are as follows: 


Theorem 4.15. Let u(R, x), v(R, x) be arbitrary functions of class C1; then 


> > 


*(a) Au, v; R) =— 2" (9, wu; RK). (4.32) 
With A= i, we have 
(b) GP (u,v; R) =Ci|u,0; Bde; (4.33) 
where 
NURI SS ye 4 a? Pai ay tai Psd? 
(c) ? (Ppj» Pyxi R) =0, (4.34) 
(d) apices BO) (4.35) 
g TP hss IR) 4 OF. G7, 
(c) Celts es (4.36) 
L (Py js Ips R) = = Spy & 
*(f) P (f(y, 25%), U5 R) => fie Lug, 0; R), (4.37) 
(1 
(g) L(u,u; R) = [ee Ua, 4, by; Ci (4.38) 
(h) P (u,v; R) =[P (gg, 3 R) © L(b,;,¥; R) — ies 


—nL(qg,0; R) © L(pp;,u; R)] 


Proof. (a) From the definition 


> 


B'(a,v; R) =(— v gry A’ fu gh)’ =— U gry A {0 Rf, 


150 DoMINIC GARDINER BOWLING EDELEN: 


which gives #’(u, v; R)= Lv, u; R). (b) follows directly from equation (4.23) 
upon expansion. (c), (d), and (ce) follow directly from (b) upon noting that pe 
and g are independent in J. (f) and (g) are immediate from the definition of &% 


OED: 
Equations (4.33) admit rather interesting results. One obtains, by use of 


these equations and by noting that (f, g) are independent in J, 
2 —1 a 


> 


Lge) Hy RB eee (4.41) 


Hence, by equations (2.12), if (p,q) form a Hamiltonian system base H, then 
(p, q) must satisfy the equations 


Vou =P (aq, H; R), Bee: 
V0p,=L(0p, HR), ae 
where 
Lea, b; . Tae daa oe RS (4.44) 
Pu= %i Pai 


A Subsidiary Result 


*Theorem 4.16. Let {dR} and {AR} represent independent infinitesimal 
changes in the elements of {R}CI. A map T of the collection F is a canonical 
map of multiplier uu if and only if the vectrix equation 


Q(d, A; R) = 
is satisfied for all {R} CI, where 

G(d, A; R) = {dRY TfAR}, 

G(d, A; R) =fdRY TEAR. 


‘9a, A; R) (4.45) 


(4.46) 
Proof. Under T 

{AR} = {R}, ey AB} — M{ARY, 

{AR} = MEARY, 
and hence 

G(d, A; R) ={dRY M'IM§ARY, 
from which the result follows since 
MIM=17 
fm 

is both a necessary and a sufficient condition that T is canonical. One D: 


This theorem is analogous to the characterization of the classic symplectic 
group by means of Pfaffians, except that in the case »>41 the antisymmetric 
character is reflected in the antisymmetric character of the structure vectrix 
for the system of equations considered. 
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Section V. Partition Spaces and Partition Maps 
Canonical Maps and Partition Spaces 


In this section we consider those maps of the collection Y which are such 
that £,; depends only on Qs and P;, for fixed 2, so that Op, ;/OF,; is zero for 
RE i 

Defimtion. By the n partition spaces of the generalized phase space J, de- 
noted by #'(p, q), are meant the » 2N-dimensional subspaces of J with coordi- 
nates (q,, ~,;) for fixed 7=1,...,”. It is to be noted that the intersection A'N 7 
for all 7=-7 is just the subspace of J with coordinates (q,) and that the union 
UP over all zis the space J. 


Definition. By a canonical partition map is meant a canonical map which 
maps the partition space (p,q) of I for each i=1,..., m into the corresponding 
partition space 7'(P, Q). 

In Section II we introduced the partition of the associated Jacobian matrix 
of a canonical map by the equation 


m—(A B\. 
C D 
Thus, under a canonical partition map, the »N xn N matrix D has only diagonal 
N xWN matrices which are non-zero. It is to be noted that the collection of all 
canonical partition maps does not form a group, since the product of the asso- 


ciated Jacobian matrices of two canonical partition maps does not result in a 
partition map in general (that is, since 


ie ) 2) bs (ee So peeatn) 


C, D, C, D, v7 C,4,+ D.C, C,B,+ DD, 


the product of two canonical partition maps will be a partition map if and only 
if either C, or B, is a matrix all of whose elements are zero). 


Theorem 5.1. Let si be a smooth two-dimensional subspace of P*(p, q), and set 
I dpa, dy. (5:4) 
Denote the image of %' under a canonical partition map of multiplier u by I: then 
a ee (5.2) 

lL 


Proof. Since s, is a smooth two-dimensional surface, we may characterize 
it by a continuous two parameter family wu, v and thus 


te OGG: qa) 
IJ i 3 (u, 0) dudv 


é 


= i) ee as Pai, Ia, u) dudv 


=f fi|u,v;paq}dudy, 


2 
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a 


so that # is just the integral of the negative of the 7° component of the Lagrange 
bracket vectrix. Hence, by Theorem 4.3 (h) 


{i| u,v; Pe Vd, Dik OF, 


and so ; 
Ya | {i|u,v; PO}dudv 
Bh. 
ar 4 f eke Adie = iog* 
MJ O(u, v) lu 
Q.E.D. 


Let the numerosity of the « index set, , be N and consider the following 
N-vectors in the generalized phase space I: 


(a) the N-vector 
¢ {aa 
P= (Post 


From these N-vectors we may construct ” 2N-vectors fz} whose components 
are defined by 


(b) the 2 N-vectors 


25a = a> ot Nie as 


It is to be noted that fz} spans the partition space P* (p, q). 


Let T be a canonical partition map so that T maps z,; to Z;. (That is, {z,}= 
{2;(Z, x)}.) Set ,)= {Zz ,3 and let 


OnE 
= (ee), (5.3) 


where E is the N XN identity matrix, so that 
f= i dee Tete (5.4) 


We may state the fundamental Theorem 2.2 in terms of partition maps as follows: 


Theorem 5.2. A map T of the collection F is a canonical partition map of 
multiplier wu if and only if it is a partition map which is such that (I;;) satisfies 


the equation 


Proof. Equation on which defines canonical maps, may be written in 
the form 


L({RY, {RY) = L (ERY, (BY), 


as follows from part (h) of Theorem 4.3 since we may evaluate the Lagrange 


bracket vectrix in either the {R} coordinates or the ERY coordinates. Now, 
from equations (4.5), (5.3) and the definition of the Lagrange bracket vectrix, 
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we have 


L(u,0) = {RY TER, = eY Led 02 
so that 

L (23.423) = Ke3.cza) (3, a) 

; ==) EG) e- 
Thus, evaluating in the {Z} coordinates, we have 


E oO E O 
(AVE Od Ree I é) 
C7) Ce lu (s ie) i i) | 


upon requiring the map to be a partition map. Q.E.D. 


d 


Generating Vectors for Canonical Partition Maps 


We now establish a collection of canonical partition maps which will be of 
general use. From the proof of Theorem 3.4, it was shown that a map was 
canonical if and only if 


L- (Dai Ga, i = HZ) my; Fj Qs, ar K 
differed from zero by an element of the null set of the Euler-Lagrange operator, 


y 


Ha 304 Biase) 


Od OL, 
s=0 
characterized by the equations 


Bees be B 


» dy tht? %1,aR 


B 


Bi, ay 


and all other B’s identically zero. Since the above equations imply that there 
exists an -vector J’ such that 


it follows that 


If det |A|, where A is defined by equation (2.16), is non-zero, we may solve 
qx(P, Q, #) for Q, so as to obtain Q;=Q,(P, q, x). Set 


VERO =e Q, x) “5 (Fj Qs) ; a W, i(P, q, x) 
where Q, is written in terms of (P, q,#) by Q;= Q;(P,q, a). Thus 
[i = WE, q,#) — (£4; Qe), 
=W, 40,1 + Wi w,, ai,e + 8M — Os Foi.4 — Fai Op,7 


and hence 
lu (Pas Vea ae ff) —- Fj es > K 
=W, dai Win fait 8M — Os Fay,3 — Foi Co,i- 
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If this is to be satisfied for arbitrary (P, q) independent of H, it is required that 
K=0,W, +4, 
Pai = W, an? 
Op O15 = Wi, le 
The last of these equations implies 


W; p =0, 1 ] 
9° Bi 

x 4, Thi yi) 1, Fay 

a 7 


We have thus proved the following 


Theorem 5.3. Let (p,q) form a Hamiltonian system base H and T be a map 
of the collection Z; then T is a canonical map if there exists a vectrix function 


> 


W(P, q, x) of class C? such that 
K =0,W,+yH, (5.6) 
Mba Wi, (5.7) 
1 

= _W. 8 
Oe Wi p> (5.8) 
Wi p=, Lemay (5.9) 
1M 2,= 1%. 2, (5.10) 

a 7 


That equations (5.6) through (5.10) determine the map T is seen from the 
fact that since det |A| was assumed nonzero, we may solve equation (5.8) for 
q,(P, 0, x), which when substituted into equation (5.7) gives p,;(P, Q, x). 


Extended Point Transformations 
Theorem 5.3 forms the basis for the development of a collection of canonical 
partition maps which are the canonical extension of generalized point trans- 
formations. 
Definition. A generalized point transformation is a transformation which is 


representable by the equations of transformation Q=(Q(q, a) where det (Q, ) 
is nonzero for all x in a given J*. “ 


Theorem 5.4. To every generalized point transformation there corresponds a 
nondenumerable collection of extensions such that the resulting transformation 
equations are canonical. In particular, if Q=Q(q, x) is a given generalized point 
transformation, then 


Op = Qp(9, 4), (5.44) 
Fe, = (Qe me (U Poi t S; Py Hg) 


1s canonical, where S,; are the convponents of an arbitrary n-vector function of q, # 
which vs of class C? in q and of class C1 in x. 
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Proof. From equations (5.8) through (5.10) of Theorem 5.3, we have 


1 j A 
HOR = 2M, CHA) = BW, ge Wy m0, fae 


Bi 


Let Qs= Qs (q, x) be the given generalized point transformation; then identifying 
this Q with the Q in Theorem 5.3 and solving for W, results in 


W; = 03 (9, &) P;;— S;(q, x) 


where S; is an arbitrary vector function of class C? in q and of class C! in x. 
Substituting this expression into equation (5.7) gives 


P Pada We dias Op gi tesin Si ay? (5-43) 
and hence 


Psi = (Ong 2 (Pui + Sig)» 


which is well defined owing to the fact that det[(Q, s )]==0 since the original 


transformation was assumed to be a generalized point transformation. Q.E.D. 
Henceforth, we shall refer to equations (5.11) and (5.12) as the canonical 
extensions of generalized point transformations. 
The properties of the canonical extension of generalized point transformations, 
given by the last theorem, are summarized in the following theorems. 


Theorem 5.5. The canonical extensions of generalized point transformations 
are canonical partition maps. 

Proof. This result follows immediately from the definition of canonical 
partition maps and equation (5.13). 


Theorem 5.6. The collection of all canonical extensions of all generalized point 
transformations forms a group. 

Proof. From equation (5.11) it is seen that the matrix B defined by equation 
(2.17) is a zero matrix. Hence, the product of the associated Jacobian matrices 
of the canonical extensions of two generalized point transformations is a Jacobian 
matrix of a canonical extension of a generalized point transformation. Since 
this was the only result which was lacking in order that the collection of all 
canonical partition maps form a group, the results follows. 


Corollary. The collection of all canonical partition maps possesses a sub- 
collection which forms a group, namely the canonical extensions of generalized point 
transformations. 

The canonical map used by Lanczos?® to examine the electric field effects 
in his unified field theory is in actuality a canonical partition map as is evident 
by inspection of equation (6.8) of his cited paper. This is one of the most vivid 
examples of the use of generalized canonical maps to yield fundamental infor- 
mation about the structure and implications of a system of partial differential 
equations of the generalized Hamiltonian form. It is one of the few, if not the 
only example of generalized canonical maps in the present literature. 


18 T.anczos, C.: Rev. Mod. Phys. 29, No: 3, 341—342 (July 1957). 
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Section VI. Infinitesimal Canonical Maps 
Infinitesimal Maps 

Consider those elements of the collection 7 which are such that a constant 
multiple of the image of {R} differs from {R} by a quantity {AR} which is an 
N(n-+1)-element column matrix whose elements are of infinitesimal order ¢; 
that is, Ny 

ALR} = {RS + {ARS (6.1) 
where A is a constant. 

Definition. A map T of the collection 7 will be said to be an infinitesimal 
map with dilatation 4 if the associated transformation equations satisfy the 
equation (6.1) for {AR} of infinitesimal order. 

Definition. A map of the collection 7 will be said to be an infinitesimal map 
without dilatation, or more simply an infinitesimal map, if A=1. Computing 
the matrix M™ from equation (6.1), we have 


ARS, cy = E* + {AR}, gy 
so that 
TY eta + (H*+ em), em={AR} gy, (6.2) 
which implies 
M = 1(E* — em)+x (6.3) 
to within a matrix whose elements are of order «?, denoted by x. 
Let us now restrict our attention to those infinitesimal maps with dilatation 
A which are canonical (7.e., M is symplectic with multiplier ~). Using the results 


of Theorem 4.2, we may write M= MM*, which from the definition of M, 
namely Mee E*, gives M= 7 M*, Comparing this with equation (6.3) yields 
M* = E* — em, 


a ae 


from which we may conclude that a canonical map which is infinitesimal with 
dilatation A is such that its multiplier is the reciprocal square of the dilatation. 
We also know from Theorem 4.2 that M* is a basic symplectic matrix, and hence 
M™* is the associated Jacobian matrix of an infinitesimal map since from the 
above considerations “4=1 which implied that A=1. 

We consider, from this point on, only infinitesimal maps since infinitesimal 
maps with dilatation can be constructed from infinitesimal maps by an appro- 
priate M. 

If an infinitesimal map is to be canonical, it is both necessary and sufficient, 
by Theorem 2.2, to require the associated Jacobian matrix of the map to satisfy 


MIM=T, 


since infinitesimal canonical maps are elements of the basic symplectic group. 
Substituting from equation (6.3), where A is unity, gives 


MIM=I— e(Im-+ mT) + x, 
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and hence 

= e(Im+m'I) +*= O 
so that, to the first order in ¢, it is both necessary and sufficient for the matrix 
m to satisfy 


> 


Im +m I— O 
in order for the infinitesimal map to be canonical. Noting that m’ I= — (Im)’, 


the following theorem results: 


*Theorem 6.1. An infinitesimal map of the collection F is an infinitesimal 


canonical map (of the basic symplectic group) if and only if the vectrix Im is 
symmetric. 


Analytic Characterization and Generating Vectrices 


Theorem 6.1 provides the basis for the explicit analytic characterization of 
the collection of infinitesimal canonical maps of the unitary symplectic group. 
Differentiating equation (6.1) with respect to {RK} gives 


{Rh ery = E* + {AR} pp = M4, 
which when compared with equation (6.2) results in 


Em = {AR} gry- 


Let 

fAR¥ ¢ SOR}, (6.4) 
and hence 

m={OR} ery. (6.5) 
Now 


(Px ig) & (OPai,p, Fs (Pain dei 
Five (— 044.4.) 1 ~ (4, 9,,) 4 . — 04, »,) 4 


= (oq, yp en = (6 da, p,,) Cy ae o( da. p,,.) Cai 


as is seen by direct expansion of Im using equation (6.5), and 


(Pai 4) e; Pes (04, 4) e ep eal (04. 4, ‘ en 
mT = (Paty) CO (oq, Pas Ciseee-t a ae (044 »,,) Cy 
OP ye ej ak (6 Do. ape ey OS: aa (oq, a en 


li Im=m 1’, so as to satisfy Theorem 6.1, it is required that each cepa penten. 
of this vectrix equation be satisfied (that is, the matrices which multiply e, 


as a result of the equation Im—m'T'= O, must be equal). Thus the matrix 
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which multiplies e, gives for k=1,...,” 
OP aka, ne Oban a (6.6) 
Oda p,, = 998»? (6.7) 
a 2a, = BoP o,,) (6.8) 
Obah,»,, = 9 yap (6.9) 
Ode, 54, =0 (6.10) 
Since the above equations must be satisfied for all k=1,...,, equation (6.10) 


implies that dq, does not depend on #,; for all 6 and 7. Hence, equation (6.7) 
is identically satisfied by the conditions imposed by equation (6.10). Thus, 
since 6q, can depend only on q, and x at most, equation (6.8) implies that df,, 
must be linear in the p,;. However, equation (6.9) implies that 6f,; can depend 
at most on #,;, q,, and « for fixed 7. Hence, equations (6.6) through (6.10) 


pace dda = 844(4p.2), (6.41) 
O Pai = Nay (Ig, ©) byi + Sai(dg, £), (6.42) 

Praia, = OPbig (6.13) 

(6.14) 


= 9 dag BOP BR, age 


where h,,, and g,; are arbitrary collections of functions of class C1. Substituting 
equation (6.12) into equation (6.14) gives 


— 09x 9, = Mul,» &), (6.15) 
which are completely integrable if and only if 
Nea, oe hy Gs: (6.16) 


Substituting equation (6.12) into equation (6.13) results in 
Nay ay Pa; Ate Sai, a, = Ney a, he a> SBig, 
which, upon using the integrability conditions for equation (6.15) (that is, equation 
(6.16)), will be satisfied if 
Sai,q, S814," (6.17) 


Equations (6.16) and (6.17), which are necessary conditions for general solutions 
of equations (6.13) and (6.14) to exist, are satisfied by 
Nay = ty (G2), (6.18) 


=1,4,(42), (6.19) 


where f, and I i,q, are arbitrary functions of class C? in q and of class C! in ax. 


‘OF 
Say 


Substituting equation (6.18) into equation (6.15) gives 


Th Oda. a, — ha ay? 
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which upon integration results in 
Oda ee cls x) £ 


Theorem 6.2. For the cases n >1 an infinitesimal map of the collection ZT is 
an infinitesimal canonical map if and only if 


Aq. = — &f,(4, £), (6.20) 

A Pai = ty 4, (4%) bys + Lig (4 2) (6.21) 

where f,(q, x) and I;(q, x) are arbitrary functions of class C2 in q and of class 
Cin x, 


Proof. For the case  >1 equations (6.20) and (6.21) follow directly from 
equations (6.4), (6.12), (6.18), and (6.19), which constitute a general solution 


> 


to the symmetry condition of the vectrix Im as given in Theorem 6.1. That the 
above equations do not apply in the case n= 1 is seen from the fact that equations 
(6.9) and (6.10) are no longer required since k can have only the value unity, 
and equations (6.6), (6.7), and (6.8) become 


Patty. OP Bo 
OO ange 


es sh Ee OP a1», 
which are satisfied by 
Ob, hee We 


0d, =W. 


Pol? 


where W is an arbitrary function of (f,;, q,, #) of class C?; the present results 
thus reduce to those of the classic literature for the casen=1. Q.E.D. 


Let = 2s 
W=é, (7, (q,#) p,; +1; (4, 2)) ; (6.22) 
then equations (6.20) and (6.21) are equivalent to 
{AR} = eT '9 {Wx}, (6.23) 


as is seen by direct expansion. Thus equation (6.1) becomes 

{RY = {RY + eT 0 {W 2}. (6.24) 
The vectrix function W will be referred to as the generating vectrix of the in- 
finitesimal canonical map given by equation (6.1). 

It now remains to determine the remainder function associated with the 
infinitesimal canonical map specified by equation (6.24). By equation (2.22) 
the remainder function U satisfied the equation 

To {R} = {Ui}. 
From equation (6.1) 
VER} = VER} + V{AR} = {AR} = eV (I # {We}), 


Arch. Rational Mech. Anal., Vol. 5 14 


160 Dominic GARDINER BOWLING EDELEN: 


and since 


{Uj} = M {U} = (Bt — em) (Ua), 
it is thus required that i 
{U rb = &| TeV) (176 {7 x), 
to within a quantity of order ¢? (i.e.,0(U)=e), 
{Un} =e Vel 19 {We}) 


which shows, from the form of W, that we may take {C}=0 in Theorem 1.2 with 
no loss of generality so as to obtain 


{Up} =e Vel y= ef(VoW) x}, 
and hence r 
Di had (6.25) 


Thus, by equation (2.47), we have 
K=H—eVeW. (6.26) 


From inspection of equations (6.20) and (6.21) we have 


Theorem 6.3. The most general infinitesimal canonical map, for the casen > 1, 
is a canonical partition map which is the extension of an infinitesimal generalized 
point transformation. 


Hamilton-Jacobi Theory for the Case n=1 


The true nature of the generalization over the classic theory (7=1) is pointed 
up most graphically by contrasting a particular result obtained in the case 
m=1 with the lack of any counterpart when »>1. For n=1, from equations 
(6.6) and (6.10) we have 

Abai ak ta eWas 


6.27 
Aq,=eW»,., ee 
and the corresponding Hamiltonian equations 
bs toate: Hh qa? ad 
; j==—. 6.28 
a Lntive hay bbs Be (6.28) 


If H is of class C?, we may identify eW with HAx,, since in the case n=1 there 
is no restriction on the form of W other than that it be of class C2 in (Dp, q). 
Thus, equations (6.27) become 

A Par =e Vee, Am 


6.29 
AG. = a 4%, ( ) 
which, by equations (6.28), become 
AP — bur Ax, 
(6.30) 


Aq. = 4.4%, 
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and hence 
a= = Part bu An, 


(6.31) 
G.=42+4,4%.- 


If (f,,,9,) are identified with initial conditions, equations (6.31) state that the 
solution to equations (6.28), for sufficiently small Ax, , is given by an infinitesimal 
canonical map with generating function HAz,, since equations (6.31) are just 
the first two terms in the power series expansion of the solution in a neighbor- 
hood of the initial conditions. Thus, in the case n=1, the solution to equations 
(6.28) is the result of successive applications of infinitesimal canonical maps. 
This fact coupled with the group property of canonical maps states that the 
solution of any problem described by equations (6.28) is the unfolding of a 
canonical map from the initial conditions to the final state, at some time a say. 
In other words, there exists a canonical map which maps the initial conditions 
into the solution for a fixed value of the independent variable x, in the above 
Case Zy. It is this result which forms the basis for the integration of equations 


(6.28) in the case n=1. 


On the Existence of a Generalization of Hamilton-Jacobi Theory for Case n>‘ 


Let us examine the case where m > 1 to see whether there exist bases functions 
such that the same result follows under an infinitesimal map as shown above 
for the case n=1. From equations (6.22) and (6.23) we have 


Aq,= a ey n= us" i) Who, 
3 j 
for all i and 7 equal 1 to 7. In order that 4q, will be equal to dq,, it is required 
that = 
—e QW, = 921d", dda’ =e. (6.32) 


From equation (2.1), since (p,q) are assumed to form a Hamiltonian system 
with base H, we have 


so that we require 
— 2M, y= Hoge t®- (6.33) 


Since W;, must satisfy the relation 
le © 3 


x Mpa — 1M i, bys? (6.34) 


we obtain, upon integrating equation (6.33) subject to the conditions (6.34), 
the most general form which H can have in order for there to exist a solution 
to equation (6.33) under the conditions (6.34), namely 


H=— y,;(4,#) Pai t+ 4 (9,2). (6.35) 
In this case 


é W;= War bai dX + £5; (4, ae) dx}. (6.36) 
441* 
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From equation (6.7) we have eee 
qe 


which, by the use of equation (6.36) becomes 
k 
A bp; = Yar, q, Pai + Sik, q,) 4% (6.37) 
We have ; 
app; az Ppp a , 
so that in order that dp,; be equal to Af,, we require 


k 
Dep pan as (Par, 4, Pag + Fit, g,) dx 


which for arbitrary dx* gives 
Ppi,k= Var, q, Pai + S58, gy” (6.38) 


If we set 7 equal to k and sum over the common index and require that the 
arbitrary functions &;, satisfy 
C4 (6.39) 
we obtain, using equation (6.34), 
Pei, awe es FL ap 


which is just equation (2.1),;. Thus, equation (6.38) is consistent with the Hamil- 
tonian equations which (p,q) satisfy if equation (6.39) is satisfied. Hence, 
we have 


dq, =Aq,=— type, (6.40) 

pg; = A dp; = (Par Pag + §jn),0p2* > (6.41) 
oo panels (6.42) 

H = — ,4(q, 2) bes +4 (4,2). (6.43) 


Thus far we have satisfied the condition dq,=4q,, dp3;=Ap,; in a formal 
manner only. It remains to show that the assumed differentials are actually 
differentials. From equation (6.40) we have 


aq, =e ade tind Par dx, (6.44) 
which yields q, ,= —,,- If dq, is to be a differential (exact), it is required that 


Wak, 7 = Wajk: (6.45) 
Since 


Pack, = Park, W871 2% Pak = — Poiag Vert Sp Pai 
by equation (6.45), we require y,,, to be such that 
Be SS Pori,q, PBR + On Proj: (6.46) 
Similarly, from equation (6.41) we have 


15; = bp; 04% = (War Pag + En), qg 4%" (6.47) 
which yields 


Ppj,k = (Pak Pai + Sin), 08° (6.48) 
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Hence, if dp,; is to be an exact differential, it is required that 


(Pot, g Pas + 85h, gg)} 7 (Yat gy Pai a C71, ag) 
But 


(Pak, gg Pai + S48, q9),4= mes Yak,ag a, Pai Pri 
7 Sit agq, Prt 

T Yo kag (Yy7 dy, Pay ois ost ai + 
ao (Par, gy Pas + Fi, gg) 


by equations (6.44) and (6.48). Thus, y and € must be such that equations (6.46) 
and the following are satisfied: 


= Pai Pak, a, Pr a Pat,q, Prk = Wak a or Pot),ag+ 


‘ (6.49) 
+ [Yar Sila, ae Put Syh,q, + 9,854 — % Fit), ag- 
By equation (6.46) we have 
Pak, q., Prt we Pol, g,, Py k 7 Oy Wak = On Put 
so that equation (6.49) becomes 
= 22 aj (Vor rt Ox Pat), ag 
(6.50) 


as [Pur Si1,q. — Pat Sih ay a 0, on “= Op fst], ag 


which is the second required condition in order for the Hamiltonian system 
given by equation (6.43) to be completely integrable. 

Let A be a parameter defining a curve lying in the boundary of Y* connecting 
the points e=a and «=b; a,bc GX OG,. We then have 


d xk 
qa(b) = q,(@) —~ Pak th 


ad xk 


Pu, (B) = pa; (@) + (Ven Pag + Fj8),04 qq 44> 
he dqaldA=— Yarlea (6.51) 
dp,j|daA= (Yen Pawar Sivas eee h’, (6.52) 
dx*'da = hi, (6.53) 


and hence any initial conditions for which there exists a solution to the 
Hamiltonian equations with bases function given by equation (6.34) must satisfy 
equations (6.51), (6.52), and (6.53), since points arbitrarily close together on 
the boundary can have their respective values of {R} connected by a canonical 
map. We have thus proved the following 


Theorem 6.4. There exists an infinitesimal canonical map which generates 
a local solution to a generalized Hamiltonian system if and only if the basis function 


of the system ts given by a ces, as) (6.54) 
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where 
— Yur,q, Pbi + Par = — Pai, VaR + Oe Pai» 
O= 2f.; (AVPar — On Patl,ag + (6.55) 
= ae ate Pat Sik, q+ OE; k a 45; 1], ag» 


in which case the generating vector 1s given by 


&Wi= Wan Paid + &;,4%, (6.56) 
Saux dae Ep am (6.57) 
the increments in (p, q) are given by 
dy = Ada == Yard ™ (6.58) 
dbaj = Abaj = (Yer Paj + En), 099% (6.59) 
In addition, pg; satisfies the additional equation 
Ppj,k = Yar, dg Pap Sika, (6.60) 


identically, and the boundary conditions must satisfy 
dq,|dA =. Valk ie 
dp, |da= (Pen Pay + Sik), ay a (6.61) 
dx da= he 


for all A, where 2 is a geodesic arc parameter in the boundary of Q*, if solutions 
are to exist in the large. 


If the above conditions are satisfied, then from the group property of canonical 
maps we may obtain the solution to the Hamiltonian system at all points of Z*. 
It should be noted that the bases function given by equation (6.54) is such that 
there does not necessarily exist a corresponding Lagrangean system since the 
second derivative of H with respect to any p,; is zero (cf. Theorem 3.3). 


Transformation Groups and Divergence Equations 
Let {Kk} be: such that'd (ido, = 0, tor (Rs R ;;#) given, and consider a 


transformation on {R} nel by 
{R} = {R} + eff(R)} ++, (6.62) 
V {RY = VERY + © VERY} +», (6.63) 


where ¢ =O is a parameter independent of {R} and «. From the form of equations 
(6.62) we see that e=0 gives the identity map, while 


{7} = O£R}/Ge|,<9- (6.64) 
Starting with the given ¥(R; R ,; x), we define a new L(R; R ;;#) by 
L(R; Rj") =L(R;R,; 2). (6.65) 
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It should be noted that the function L, as defined by equation (6.65), is not 
the function #(R; R ;; a). In fact, it is not necessarily true that L is close to 


LR; Re ax) even when ¢ is very small. We thus consider the problem of what 
conditions must be placed on the transformation equations in order that L 


shall be close to L(R; ie a), that is, 


~ 


L(R; R ;; 2) —L(R; R ;; 2) =O(). (6.66) 
Expanding LR; ie x), we have 


LR, VR; @) =2(R;VR; 2) + e[L ef + Liye VER +* 


Now 


Ve (Lo ery {f) =V* Leeny) + Loy VR, 
so that 


LR OR; a) = 2(R; PR; 2) + 0(Po(Leey fh) + Elie B+ * (6.67) 
since {E | P} en3 =F gry — VoL een by definition. Hence, equation (6.66) becomes 
L(R; VR; )=L(R; VR; 2) + 6(Ve (Leen fh) +{El Dea {hl+*. (6.68) 


Since {R} was assumed such that 6 f #dv,—=0, we have {E|L}zx3= [0]; hence 
a necessary condition that equation (6.66) hold is that {/} be such that 


V9 (Lee {B)=0, (6.69) 
From equation (6.67), we also have that 
LR; VR: 5) ed OE VR; 2) +x 
(that is, Y is invariant to within terms of order ¢?) if equation (6.69) is satisfied 


and conversely. We have thus proved the following well known 


Theorem 6.5. Jf #(R; VR: a) is invariant under the transformations (6.62), 
(6.63) to within terms of order &, then Vo (LF ery {f}) =0 (that is, the system of 
equations admits a conservation law), 

Teta 
and conversely. 


Let us apply the above results to the Lagrangean (3.28) for alpha systems. 
We have 


PL =1ERY Ge VER} — H(R, a2), 
(E|L}eny = VERY 0a + H cry, 
so that Lf gpy= 3 {KR} a, and hence equation (6.69) reads 


Ve ({RY¥ &£f}) =0. 


V{RY 06 ff} + {RY de VEN} =0, 


(6.70) 


Expanding yields 
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which upon using the second of equations (6.70) gives 


— gry {ft + {RY 0 V Uff = 0. 
Theorem 6.6. The Lagrangean for alpha systems is invariant under the trans- 
formation (6.62, 63) to within terms of order «, if and only if {f} is such that 
ER} &0 Vif} = H gry {fF 


= re (6.71) 
V {RY 8+ H gy =0 


ave satisfied, in which case Ve ({R} a {7}) =0, and conversely. 

We have shown in Theorem 3.1 that the Euler-Lagrange equations are in- 
variant under all transformations T of class Cl. Thus, from the second of 
equations (6.70), we have under 7 


VERY + eff ++] a+ H gay = {0} 
but 
A ety = A egeyt {A eh ees {ih +*. 


so that, upon taking the transpose, 
ae V{R}+ che V Ef} — {Hn} — eH a}, cy + * = {0}. 
Noting that « VERY — {H x} = {0} since {R} is such that 6 f Ydv,=0, we obtain 


co PU} — {Hs}, cay {P+ * = (0), (6.72) 
which are the variational equations associated with an alpha system. Sub- 
stituting this result into equation (6.30) gives 

[LAS {7 eh cxy — A gry] {ff = 0. 


Hence, we may conclude that equation (6.69) holds when £7} is any solution 
to the variational equations (equations (6.72)) if and only if HT gry is a homo- 
geneous function of degree unity in {R}. 


We have, on comparing equations (6.23) and (6.62, 63), that Y is invariant 
under (6.23) if and only if 


0= Ve {RY T(E {W 2})) 


= VARY Wh + ERY (OH adc VERY + (a3) 
by equation (6.25). 


(6.73) 


Functional Changes Under Infinitesimal Canonical Maps 

Let f(p, q) be denoted by /(R), and similarly let #(P, Q) be denoted by {(R). 
The question arises as to the increment in /(R) when its arguments are subject 
to an infinitesimal canonical map. The quantities {R} and {RY are connected 


by equation (6.24) so that {(R)=f(R+AR). Expanding in a Taylor series 
about R gives 


~ 


HR) = F(R) +f gry {AR} + x. 
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Ignoring terms of order ¢* and substituting from equation (6.24) results in 


#(R) = 1(R) +f gy er {Ws} 


={(R) + L(eW, fe 
and thus 


{(R) —f(R) =e L(eW,f). (6.74) 
Similarly if /(R) is vectrix function of (Pf, q), 


i(R) =(R) +f ¢ny {AR}, 


= F(R) +f gy (i 3) € 
Sle ee Le Wf ¢ 
and thus 
/(R) —f(R) =e P(0W, f) (6.75) 
where 


L(eA, B)=B yey (I 1fA 53). 


Hence, the left-hand bracket vector is said to be the symbol for the collection 
of all infinitesimal canonical maps which arise from equations (6.20) and (6.21). 


Section VII. Conservation Laws and Space Structure 


The results of this section are considered of such importance that, although 
they logically could have been included in previous sections, they have been 
assigned a separate section. 


The Basic Conservation Equations 
Theorem 7.1. Let {y} form an alpha system base HES OQ, with structure 


= 
vectrix &, for all x in a given G*, and define the bivectrix W, with components W,,, by 


a7? 


Wis = 3 {yy a Ly}; — 05; (2 {yh op {yh — 2); (7.1) 
then fp} is such that it satisfies identically the n equations 
at all points x in DF. 
Proof. From equation (7.1), we obtain by direct operation 


W,,. Les Se a; {yh j —${y3 a {it Ze lees t+ 0,1. 


By hypothesis {y} forms an alpha system base H€ © with structure vectnix 
« at all points in Y*, and hence {y} satisfies the equations 


Vipy eG + H gy = {y¥ 0; + He; = [0], 


Wij c= 3 {Yh i % {vj — 3 kyh keh et Gf — {yh ies {yt;. 


so that 
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yoy 1 yy’, oe; Lp}, — EE py 1 Lyd. = BAYH (i — 1) Eph 


but a;——«a;, since @ is a structure vectrix, and hence 


BL YY OLY} | — ELVES, = Lyd LYE, 
Substituting this result into the last equation above for W;; ; gives 


W,,;=0;H.  Q.E.D. 


47,4 


The divergence type equations (7.2) constitute the basic conservation laws for 
alpha systems. Considerations similar to those above have been used by many 
authors for a variety of purposes. To note an instance of particular importance, 
EINSTEIN !9 developed the momentum-energy pseudo-tensor for the gravitational 
field and the matter field by similar considerations. 

It is to be noted that equations (7.2) result without any assumptions as to 
the differential geometry of the space of independent variables. In particular 
we have not even assumed that the space of independent variables is metrizable 
except in a local sense but only that it satisfies the postulates of a Hausdorff 
space as stated in Section I. 


The Principle of Extremal Invariance 


We have shown in Theorem 3.7 that alpha systems arise from the extremization 
of integrals of the form 


I=f Ldd,, (7.3) 
where * 
L=% {y} ae V fy} — Ay, 2); (7.4) 


the variation of {y} vanishes on the boundary of Y*, and no variation of the 
independent variables is considered. We now consider more general variation 
processes in which the variation of {y} does not necessarily vanish on the boundary 
and the independent variables are also allowed to vary. 


Set 
{P (x, €)} = {p(@)} + e{y(x)}, (7.5) 
x= e+ edx(x), (7.6) 
{P(#) 5 = {y (w)} + € 0 {p(a)}, (7.7) 
which are related by 
d{p(@)} = {n@)} + V fy (@)} @ dae (a) (7.8) 


to within terms of order ¢?. Computing the variation of % resulting from equa- 
tions (7.5) and (7.6) gives 


OF = L (Lent + Zee 9 Vim) do, + [ £oeQnas, 
DnODn 


a2 ane A.: Annalen d. Physik 49 (1916). — Sitzungsber. d. PreuB. Akad. d. 
Wiss. 1916. 
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where equation (1.16) has been used. Since 


Lig Vit =V (Lire {ns) —(V° Zireuy) {n}, 
we have the equivalent form, upon applying equations (1.14) and (1.15), 
Sd J (F g43 — VoL Feex) {ngde,+ J (Lba+ Loeox{m)@NdS. (7.9) 
Zne OGn 
Solving equation (7.8) for {£7} and substituting into the second integral on the 
right for equation (7.9) yields 
6 = J E\ LZ Seo {0 Wn + 

+ S {f ba + £74(6 fy} — Viv eZ 1% da)}QN 7dS 

G2 Zu 
upon noting that {E|2}.,,;—f .,3— Ve LF 4,3- Expanding this form in terms 
of components then yields 


OF = [1E| Lhe kh do, + 


(7.10) 
TT ie oe 655 — fL 53 AVS») ) dx “10 Lg, O{p}} NaS. 
Set 
(23 = Lig; (7-11) 
and 
T,, = £46;,—{Z} {vy} ;: (7.12) 
then 
F=SE|P heey 1 dn + f {T;, 6x7 + {Z¥ bf ph} N,dS. (7.13) 
Zi GiO7Dn 
Making the additional substitutions 
T,,N,dS=dY,, {Z)}'NdS=d{Py}, (7.44) 
we obtain the following form of the variation of J: 
oo E\ Zhen Fen + bea bx" + dt{P¥' d{y}). (7.15) 


Equation (7.4 5) states that 
(¥,,%*) and ({7}, {y}) 


may be considered as conjugate quantities on G*OG, in the same sense that 
momentum and coordinates are conjugate variables in the Hamiltonian mechanics 
of particles (n=1)™” 

There are several significant conclusions which may be drawn from equation 
(7.45). Under the conditions 


{E|\Z}3=[0], S{y$latoz,={O8- (7.16) 


2#® Weiss, P.: Proc. Roy. Soc. 169, 102—199 (1938). 
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(.e., {p} forms an alpha system and the total variation of {y} vanishes on the 
boundary of G*), the vanishing of 6% implies 


o= f dV,éx°= f TF, Nbr aS (7.17) 


— 1 
ZrO Dn QrnO Dn 


upon making use of equation (7.14). Transforming equation (7.17) to an integral 
over Y* by use of equation (1.16) gives 
O= f (Ty, 6%) ¢dy =f (Tin, 0 + Ty 6%) dey. 
Dn Tr 


The last equation can be satisfied for general G* only if 


Li Og a Toe, (7.18) 


1 


holds at all points of G. 


Theorem 7.2. We have, for alpha systems with structure vectrix @, 


Win = — Tr, (7.19) 
{ZY = {yy G, (7.20) 
d{PY =i {pV aeNdS, (7.21) 
dV; == W. NaS (7.22) 


Proof. The results are immediate upon substituting equation (7.4), which 
defines the Lagrangean for alpha systems, into equations (7.11), (7.12), and 
(7.14) upon using equation (7.1). Q.E.D. 

Theorem 7.2, in conjunction with equation (7.18), admits an important and 
far-reaching conclusion. Substituting equation (7.2) and (7.19) into equation 
(7.18) gives 

W,, 6x*, = — (0, H) bx". 
We have thus proved 


Theorem 7.3. Let {y} form an alpha system with structure vectrix & and base 
function H, and set 


SL = i aus 
Di 
where 


L = {yy de V{y} — Hy, 2); 
then the variation of %, as induced by equations (7.5), (7.6) and (7.7), vanishes 
for 6 {py} ={O} on BX OG, if and only if da satisfies 
W,,, Ox", = — (0, H) dx* (7733) 
where W;, 1s defined by equation (7.1). 


Examining equation (7.15) in detail, we see that requiring .% to be extremal 
under the condition 6 {y}—fo} on 9* OGJ,, for da=O, results in exactly the 
variation process used in Section III requiring {y} to form an alpha system, 
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(1.e., for da=O we have, by equation (7.8), 6 {y}={n}). We assumed in 
Theorem 7.3 that {y} formed an alpha system and thereby obtained equation 
(7.23) under the hypothesis that the total variation of {y} vanished on G* OQ G,. 
Because of the importance of the content of Theorem 7.3, we state the following 
principle whereby we are assured that {y} will always form an alpha system 
under the variation (7.5) and (7.6). 


The Principle of Extremal Invariance: The quantity %, defined by 


i Se ATED 
Dn 


n? 


where 
L= sly} aeV{y} — Ay, x), 


shall remain extremal for all variations (7.5) and (7.6) such that (1) the total variation, 
O{w}, of {yp} vanishes on GX © G, and (2) equation (7.23) is satisfied. 

Under the principle of extremal invariance we have, by equation (7.15), 
that {yp} satisfies 


V fy} ea + H gy = [0] (7.24) 


and hence forms an alpha system. 


Given a solution, say {yp (a)}, of equations (7.24), we may use equation 
(7.23) to determine sets of functions 6x§(a#). These functions may be inter- 
preted as elements of a group of infinitesimal transformations of the underlying 
space &,. Under the principle of extremal invariance we see that such a group 
of transformations preserves equations (7.24), and conversely, preserves the 
extremal character of % by Theorem 7.3. Since the W;; and the H appearing in 
equation (7.23) depend on the {y} and hence on the alpha system considered, 
we may conclude further that the class of infinitesimal transformations which 
6, can admit under the principle of extremal invariance is determined by the 
alpha system defined over &,. We have thus proved 


Theorem 7.4. The class of infinitesimal transformations which &, can adnut 
under the principle of extremal invariance 1s determined by the alpha system defined 
over &,,, in the sense of equation (7.23). 

We made detailed note in Section I that we do not assume a differential 
geometry for &, but only require &, to be a Hausdorff space. By Theorem 7.4, 
we see that the introduction of the Principle of Extremal Invariance restricts 
the class of infinitesimal transformations which &, can admit in terms of the 
alpha system defined over &,. Since the infinitesimal transformations admitted 
by a space determine the transformation groups which a space admits, and the 
transformation groups in turn determine the nature and type of differential 
geometry which may be introduced into a space, we see that the alpha system 
defined over &, determines a natural type of differential geometry which 
may be introduced into that &, under the principle of extremal invariance. 
In this sense, equation (7.23) is analogous to the Killing equations?!. This is 
particularly evident if H does not depend explicitly on «. 


21 KILLING, W.: J. fiir die Reine und Angew. Math. 109, 121— 186 (1892). 


4D DoMINIC GARDINER BOWLING EDELEN: 


Section VIII. Integrability Conditions and Related Topics 

The discussion in Section II, following the first definition of a canonical 
map, pointed out that one need consider only those bases functions for which 
there exist (p,q) which satisfy equations (2.1) or their matrix form, equation 
(2.12). We take up in this section the question of the form that a base function 
may take in order that there exist ~’s and q’s such that they satisfy equations 
(2.4.2). 

Decomposition of Hanultonian Systems 
f (pf, q) form a Hamiltonian system base H, then by equations (2.12) 


Te /§R\ = —{H ,}. (8.1) 
Applying Theorem 1.2, we may solve these equations for VERY, giving 
VERS = — Tf ae HIG (8.2) 


where ros is an arbitrary column vectrix which satisfies the condition le {C}= {0}. 
Equation (8.2) is the most general explicit decomposition of Hamiltonian systems 
into systems in which each partial derivative of each dependent variable appears 
separately. 


Let {R$} be any solution to equation (8.1); then we can solve for a fC} by 
equation (8.2). On the other hand, let {Ry} be a solution of equation (8.2) for 


a particular fC}; then {R,} satisfies equation (8.1), as is evident by inner multi- 


plication of equation (8.2) by I from the left. Thus, every solution of equation 
(8.2) is a solution of equation (8.1), and for every solution of equation (8.1) we 


can solve for fC} so that it is a solution of equation (8.2). In this sense equations 
(8.1) and (8.2) may be considered similar relative to their collections of solutions. 
More important, we see that 7/ we prove the existence of solutions of equation (8.2) 


for any one fC}, then we have proved the existence of solutions of equation (8.1). 
The fC} appearing in equation (8.2) is subject only to the conditions TefC}= {0}. 
Using the defined form of Tand writing 


{C, ois 
{G3 = 6; {C;} ; {C;} = (Cari 
Shere 


we have Ie £Cy = for if and only if the conditions 


hold. Substituting this result into equation (8.2) gives 
{dah {OF 

"4 thas == Bey fe {H ri te; Cart l 
fb.) n tee iS 
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from which we see that V£q,} is determined uniquely, while V hp, »p> are deter- 
mined only to within the arbitrary functions ¢;{C,,;+ which satisfy equations 


(8.3). This situation is not a surprising one since the ,; satisfy the divergence 
type equations 


(Lee a He 
if the system is Hamiltonian. 


Expanding equation (8.2) in terms of the components of the various vectrices 
appearing gives 


{Ry SL as iCy 


where 
I =8EH,  {C}= 8 fC}. 
Let F(R, x) be any given function of R and & of class C1; then 
F =F gry {R} + OF 
=— Pep Tie + Foy {C3 OF 
= — F(A, PF) +P ay {Ci} 4 OF 
by equation (4.22), (&(H, F) is the 7" component of the right-hand bracket 


vectrix formed from I;* and &,(H, F)=—¥/(F, H)). Thus we have proved 
the following 


*Theorem 8.1. Let F(R, x) be an arbitrary given function of « and {R} of 
class C1, and let {R} form a Hamiltonian system base H; then 


F y= &{(F,H) + {CY {Fh + OF (8.4) 


where 1, f C3 - fo}. 


The Classic Integrability Theorem 


We now turn to the explicit problem of integrability of equations (8.1). From 
their equivalent form, as given by equations (8.2), we have reduced the problem 
of integrability to one taken up in the literature, particularly by EISENHART”. 
For purposes of clarity, we include a brief synopsis of the principal results. 


Consider a system of partial differential equations 
(a) 


where the y¥ are analytic functions of the w* and the x’. These equations are 
equivalent to the system of total differential equations 


0 * 


agi Yall *), CEN clay Ue bE yn 


(b) dut = prtdx'. 


The conditions of integrability of equations (a) considered in terms of equations 
(b) are 
Owe owt Oy Owe 
(c) 7 - : y= i 4 i. 
x1 0b Ont Om? 


22 EISENHART, L. P.: Non-Riemannian Geometry. Amer. Math. Soc. Colloquium 
Publ. No. 8. 
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If these equations are satisfied identically, the system (a) is said to be completely 
integrable. In this case, the solution is expressible in the form 


Op 
Oxt | 


(d) i Cee 


ea -hee 
0 


a 
where (35) 


ne 


= y%(C, x) and the other coefficients are obtained by differentiation 
0 


of equation (a) and evaluating by equating the x’s to the x's and the w’s to the 
C’s. Thus, for the domain of the x’s in which the series (d) converges, we have 
a solution of equations (a) determined by m constants. We denote such a solu- 
tion by 

(e) CSE GO. Fe Cee 


If equations (c) are not satisfied identically, we have a set /, of equations 
which establish conditions upon the v’s as functions of the x’s. If we differentiate 
each of these equations with respect to the x’s and substitute for @u%/@x' from 
equations (a), either the resulting equations are a consequence of the set /, 
or we get a new set fy. Proceeding in this way, we get a sequence of sets of 
equations F,, fy, fy, ..., which must be algebraically compatible if equations (a) 
are to admit a solution. If one of these sets is not a consequence of the preceding 
sets, it introduces at least one additional condition. Consequently, if the equa- 
tions (a) are to admit a solution, there must be a positive number U such that 
the equations of the (U +1)" set are satisfied identically because of the equations 
of the preceding U sets; otherwise we should obtain more than m independent 
equations which would imply a relation between the x’s. Moreover, from this 
argument it follows that Um. 

Conversely, suppose that there is a number U such that the equations of 
the sets 


(f) enh nee 
are compatible and each set introduces one or more conditions independent of 


the conditions imposed by the equations of the other sets, and that all of the 
equations of the set 


(g) se, 
are satisfied identically because of the equations of the sets (f). Assume that 
there are ~ (<m) independent conditions imposed by (f), say 
G, (Uys) = 0; VP SAEED: 


Since the Jacobian matrix (0G,/0u*) is of rank p, these equations may be solved 
for p of the w’s in terms of the remaining w’s and the x’s, which, by suitable 
ordering, may be written as 

(h) Pit LC a an are y=W Meta 
From these equations we have by differentiation 


Ome Og? OW 0g? 
Ox BaP aIean ee Y = Pes a 
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Replacing 0°/@x* by means of equations (a) gives 


oe OP go OP phe sett 
(i) Ce ae a SP 


which are satisfied because of the sets (f) and (g), as follows from the method 
of obtaining the latter. Accordingly, we have by subtraction 


. Om? 0g? Ow” ; 
(i) Tan Pan ie ae MO C= Ee SP, Ne ooo 


From these equations it follows that if the functions u?*1, ..., w’" are chosen to 
satisfy the equations 


(k) ae ah ise Pan dee y=P+1,...,m 


where the # are obtained from the y} on replacing w° (o=1,..., f) by their 
expressions (h), then equations (a) for «.=1,..., are satisfied by the values 
given by equations (h). Since the equations of the set & are satisfied identi- 
cally because of equation (h), it follows that equations (k) are completely inte- 
grable, for the equations arising from expressing their conditions of integrability 
are in the set because of equations (1). Consequently, there is a solution in this 
case, and it involves m —?# arbitrary constants. When =m, we have in place 
of (h) u*—g*(x), and in place of equations (j) the functions wu satisfying equa- 
tions (a). Thus we have 

Theorem A. In order that a system of equations (a) admit a solution, it is 
necessary and sufficient that there exist a positive integer U (Sm) such that the 
equations I,, fy, ..., Fy are compatible for all values of the x’s in a domain, and 
that the equations of the set Fy. are satisfied identically because of the former sets; 
if p is the number of independent equations in the first U sets, the solution involves 
m — p arbitrary constants. 


Integrability and Generalized Hanultonian Systems 


Since equations (8.2) are of the form of equations (a) in the above discussion, 
and we have seen that if we demonstrate the existence of solutions to equations 


(8.2) for at least one fC}, then we insure the existence of solutions of equations 
(8.1). We may apply the results of the above theorem directly. 
Definition. Equations (8.1) are said to be completely integrable if there 


exists at least one column vectrix £C}, satisfying the condition Te {O}= 0, such 
that equations (8.2) are completely integrable. 


Note. Since different solutions of equations (8.1) may correspond to different 


choices of £0}, complete integrability in the sense of the above definition does 
not imply that all solutions of equations (8.1) constitute an S-parameter family 
of solutions where S is finite. However, to every choice of {C} there does exist 
an N(n-+1)-parameter sub-family of solutions to equations (8.1) if these equa- 
tions are completely integrable. 
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*Theorem 8.2. Equations (8.1) are completely integrable for a given H EH af 


and only if there exists at least one column vectrix {C } such that 
To{C}= {0} 
Bi LIER}, H), H) — BGR}, H), H) + 
+ Fi({C}.H) — BEC} H) + 
+ Zi (ERY, A), cry {Ci — GRY, 2), cx {CB + (8.5) 
+ {Ci} cry {Cb — {Ci crn {Cid + 9; FERS, A) — 
— 8, ({R}, A) + {Cid + A{Ci$ = {OF 


is identically satisfied in {R} and x. 


and 


Proof. From Theorem 8.1 we have 


{RY = — FH (H, 4RY) + {CG}, 
= F(R}, H) +{C}. 


so that the result follows from equation (c) above by use of equation (8.4). 

Starting with equations (8.5) as the system / and using Theorem 8.1 to 
generate the systems Fy,..., we have by Theorem A, upon noting that m in 
that discussion must be replaced by N(n-+-1), 


*Theorem 8.3. In order for equations (8.1) to admit a solution, it is both necessary 
and sufficient that there exist a positive integer Us N (n+ 1)) such that the bases 


function H and the column vectrix EC, defined by equation (8.2), be such that the 
equations F,, Fy, Fy,... ave compatible for all values of the x’s in the domain under 
consideration, and that the equations of the set Fy.., are identically satisfied because 
of the former sets. If S is the number of independent equations of the first U sets, 
then the solution will involve N(n+1)—S arbitrary constants. 
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Scalar Diffraction Theory and Turning-Point Problems 


N. D. KAZARINOFF & R. K. RItTT 


Communicated by L. CESARI 


In this paper we wish to discuss and prove a fundamental theorem [15] 
in the theory of scalar diffraction and to motivate and describe a number of 
turning-point problems in the theory of ordinary differential equations, some 
of which are as yet unsolved. Let B be a smooth, simple, closed surface in E3, 
and suppose that B is steadily illuminated by mono-chromatic electromagnetic 
(or sound) waves from a source distribution @(x). It is of importance to know 
the field surrounding 6; in particular, one would like to know what percent 
of the incident energy is returned in a given direction, what percent is scattered 
in a forward direction, what are the surface currents, and how all these things 
depend upon 6 and the wavelength 4 of the incident radiation. This is an 
immensely difficult problem. Heuristic mathematical-physical theories in good 
agreement with observed phenomena have been found in certain cases, notably 
by V. A. Fock [2] and J. B. KELLER [10]; but a rigorous mathematical treatment 
has been given only for special shapes and wavelengths, for example, if 4 is 
small with respect to the dimensions of B and B is a prolate spheroid or elliptic 
cylinder of small eccentricity, a right circular cone, or a parabolic cylinder. 


Fortunately, there is a relation between the electromagnetic problem de- 
scribed above and the acoustical or scalar one. This connection is described 
in [10, 18]. It enables one to solve the electromagnetic problem for a particular 
body B, or at least to determine part of the solution, by solving the scalar problem. 
For the remainder of this paper we shall confine our remarks to the scalar problem 
(I) defined as follows. Let /, g, and @ be functions, having compact support, 
which are defined on the closed exterior of B, henceforth denoted by V. Let / 
and g be in C®)(V), and let the normal derivatives 0//0n and 0g/dén be zero 
on B. We postulate the existence of a function u(x, t) which is in C®(V) for 
each fixed positive ¢, which is in C®)(£>0), and which satisfies the following 
boundary-initial value problem: 


V2u —m%,= 0(x)e® for t>0; 
(I) dulon =0 on b; 
u(x,0*)=f(x), (x, 0°) =g(X), 


the convergence being uniform. 
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Existence theorems for problem (I) have been given by LADYZENSKAYA & 
ViSIK [7]. They are primarily concerned with the existence of weak solutions. 
They then show several ways how these weak solutions may be proved to be 
strong ones when the data are sufficiently regular. The work of Lewis [//] is 
much more severely restricted — B must be a hyperplane. Of course, the presence 
of initial conditions prevents separation of ¢ in (I); and at the present time it 
is unknown whether or not there exists a steady “state” solution of the form 


u(x,t) = u(x) ef?! + u*(x,t), 
where u*—>0 as t->oo. The theorem we shall prove asserts that the C —1 limit 
of u* is zero, however. This generalizes a result of LADYZENSKAYA [7; p. 96]. 
We first observe 


Lemma 1. The solution of (1) is unique. If f,g, and o vanish for |x| >R, 
then u(x, t)=0 for |x| >R+4. 

The proof of Lemma 1, under less restrictive hypotheses, is contained in 
[18; p. 135]. We now state the theorem we shall prove. 


Theorem. Let v(x, t)=e7'®'u(x, t), where u(x, t) ts the solution of (1). Then, 
T 
a) lim + fol, t) dt = v*(x) - extsts; 


T-0o 
0 


D2 OA) o"(2y = 0 (ae Peis iom = Onto BD. 
v* satisfies the Sommerfeld radiation condition; 
c) The Laplace transform, d(x, s), of v(x, t) exists for s>0 and lies in L® (V); 
ie ai 
d) jim s9(x, S) =U ae 
e) In the computation of the limit in (d), sv (x, s) may be replaced by the function 
P(x, s) which satisfies the equation 
(1) [V2 + (@ — 4s)?] ®(x, s) = o(x) 
and the conditions that 0®/6n=0 on B and @ lie in L®(V). 
Our proof requires several lemmas. 


Lemma 2. If u(x,t) is the solution of (1) with o(x)=0 (the homogeneous 
problem) and if v(x, t)=u(x, t) exp(—iwd), then the L® norms of v and v, are 
bounded functions of t. 


Proof. The uniqueness theorem (Lemma 1) permits one to express the column 


vector col(u, u,) in the form 
iu 
Uy; § 


where .}(f) is a strongly continuous semigroup on the space X= X, x Xy, Xp being 
the space of functions in C®(V) having compact support and having normal 
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derivatives which vanish on B. The topology of X, is that of uniform convergence 
of all derivatives of order less than or equal to two. By a slight modification 
of a theorem of HILLE’s [3; p. 396], one can express 9(¢) as 


Cw 


(4) = exp (nd), 


OA 
T= \i 73 Bo 


whose domain is dense in the completion of X. It is then an elementary compu- 
tation to show that col(v, v,) can be expressed in the form 


s)-one() 
U; § 


where m is described by the matrix 


where n is the closed_operator 


yee 1 
V2 a? 9-2 27.4) ) 
and where /* and g* are in Xp. 


Now let us regard col(/*, g*) as an element, not of X, but of Y=, x Yo, 
where Y),—L®(V). We seek the resolvent (m— A) in Y). To find this resolvent, 


51 82 


which is equivalent to the system 
—Af=hs 
[V? +o] f,— (200 +A) p= (21M +A A+R. 


If we eliminate g, from these equations, we obtain the equation 
[V? + (wo —t4)*) f= (20 +A A+ &. 


Now, the operator — V?, applied to those functions in ¥)y whose normal derivatives 
vanish on B, has a representation 


= =u dias 
where E,, is a resolution of the identity. But then /, is given by the formula, 
Bae Fl (@ —¢)?—p]*dE, {21m + a) A+ 8}. 
If A is positive, the norm of the operator 


Silo ai tab, 
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is clearly bounded by (2wA)+; and hence, the L® norms of f, and g,, for small 
positive 4, are O(1/A). This shows that the L®) norms of v and v, are bounded 


[3; p. 240]. 
Lemma 3. If u(x,t) is the solution of the homogeneous problem and v(x, t)= 


u(x,t)exp(—tat), then . 
w(x, T) = f v(x, t) dt= O(T*) 


0 
uniformly for xin V. 


Proof. It is easy to verify that v(x, ¢) satisfies the equation 


(V2 + w?) v(x, t) — 21m v,— Y4,,=0. 
Thus, 
(V2 + w?) w(x, T) =[21@ v(x, t) + 0,(%,t)]§ =h(x, T). 


By Lemma 1, both w(x, T) and h(x, T) have their supports in a sphere of radius 
R+T, and the L® norm of h(x, T) is bounded. Because w(x, T) has compact 
support, it satisfies a prior: the Sommerfeld radiation condition. Moreover, there 
exists a Green’s function G(x, x’) of the form [17] 

tw|x—x'| 


Ga Vee 


Jy a'| 


+ G*(x, x’) 


for the exterior problem, where G*(x, x’) has no singularities and is @(1/|x|) 
for large |x|; and G(x, x’) is such that 


w(x, T) = ill Tm polation T)dx'. 


If |x| >R+T, w(x, T) vanishes; and hence we may assume that |x| <R+T. 
Let us compute the first integral over the intersection of V with a sphere with 
center at x and radius 2(R-+ T), and let us compute the second integral over the 
intersection of V and a sphere with center at the origin and radius R+ T. Since 
both these spheres contain the support of h(x, 7), this replacement is permissible. 
But then, by using the Cauchy-Buniakovsky-Schwarz inequality, we find that 
both integrals are ©(T+), which completes the proof. 


We are indebted to Professor Carvin Wicox for a simplification in the 
proof of this lemma. 


Proof of the Theorem. We begin by representing col(v, v,), now determined 
by the inhomogeneous problem (I), by the variation of parameters formula 


t 
(CJeen(E + fe 0 Jas 
UY; g : — 0 


If we integrate both members of this equation with respect to IT, we find that 


1 fie aa ies : E 0 
=| Gee! sta! (e—plene( Olas. 


0 
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The upper component in the first integral of the right-hand member is 0(T-3) 
by Lemma 3. It therefore converges to zero uniformly as Too. Consequently, 
we lose no generality in assuming that f*=g*=0. Let W(x, T) be the upper 
component of the left-hand member of the above equation, and let w(x, &) be 
the upper component of exp(m &) col(0, —e). In particular, 

(V25- 0") W476) = 210 Wa wee OP an (se, OF) ==0,  w,(x, 0°) = =o. 


s 


Hence, 


and 
Be 
(V2 + w?) W(x, T) = 4b (1 —F.)[2i co wy + we_] 48, 
0 Tr 
=0(x)+ sid fo(x4)db+ ols, Ti), 


0 


If G(x, x’) is the Green’s function introduced in the proof of Lemma 3, 


eeiiaea dx erp le Sltad free é)dédx! + 
ra fffowns (x!, T) dx’ 


As in Lemma 3, the integrals on the right-hand side of the above equation which 
involve JT are seen to converge to zero as Too. Thus, the existence of 


Ted ea jim W(x, T) = MTG 2) 0 eva 


has been shown. This proves (a) and (b). 
A standard Abelian theorem [Z; p. 193] and the observation that 


Nata ee dle D(x te dt 


0 |a|—R 


imply (c) and (d). The proof of (e) rests on first observing, as we have, that 
the initial conditions can be ignored and then on obtaining the partial differential 
equation for d(x, s) from that satisfied by v(x, ¢) by means of the usual methods 
of Laplace transform theory. This completes the proof. 


Henceforward we shall consider only the reduced problem described in part (b) 
of the Theorem. Physically, this is not an unreasonable thing to do, since one 
can measure only the time average of a varying quantity, not its instantaneous 
values. It turns out that it is easiest to solve the reduced problem for v*(x) 
by determining the solution ®(x, s) of (1) and then letting s—0*. In order to 
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proceed further, we make the inevitable assumption that B is a level surface, 
£=é,, in a coordinate system (&, 7, p) in which the operator VP hares) is 
separable. For simplicity, we assume further that B is symmetric in @ so as 
to reduce the number of variables which need be considered to two. Under 
these assumptions, then, equation (1) may be written in the form 


— (L,+L,) ®= J(&,n) 0(§,n) 
with 
a® (&),)/0F =0 


and where the operators L; and L, depend upon & and 4 alone, respectively. 


The operator L; — is not self-adjoint so that classical Sturm-Liouville theory 
cannot be used to find a representation for ®. However, Sims [16] and PHILLIps 
[14] have provided a resolvent theory for operators of the same type as L;— iC 


and L,—y. Their work guarantees that there exist resolvents #, and k, for 
L,—y and L,—», respectively, with the property that for certain paths Tin 
the w-plane, 

I BEY SEU OSS 


(—co</l<oo, c a positive constant depending on w and the dimensions of 8B), 


En s)=51, | RA, (Te) du. 
yi 


A path I’ separates the poles of Ray and &,. If @ reduces to a point source at 
(£, t) in the (&, 7)-plane (a line source in cylinder problems), this integral simplifies 
and becomes 
1 
DoD 
B he 


Orie tS) == 


[ Co, T= Pp) GE 57h) d ie, 


where G and G are the resolvent Green’s function for the operators L; and L,, 
respectively. 


One may attempt to evaluate this integral as a residue series by considering 


either the poles of G or those of G. The expansion involving the poles of the 
angular Green’s function is a classical one, often called the Mie series expansion ; 
but it converges so slowly when 4 is small as to be almost useless then. The 
expansion involving the poles of the radial Green’s function corresponds to the 
use of the Watson transform, familiar in right circular cylinder and sphere 
problems. However, except in such simple cases the proper formulation of the 
Watson transform had not been found prior to our approach. In cases which 
have been studied thus far, the radial residue series is rapidly convergent, when 
A is small for points (&, 7) in the geometric shadow of B. 


It is in determining the terms of the radial residue series, particularly in 


locating the poles of G, that turning-point problems arise. To see this let us 
consider some specific examples. 
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A) The elliptic cylinder. Let B be an elliptic cylinder with semi-major axis a, 
semi-minor axis }, and eccentricity e=sech &. In this case, 


eS gee 


faa + y* sinh? & 


and 


Bi ghent hi + y? sin? 
1 dye y sin" 7, 


where y=ae(w —1s). Then if O0<Qu<2m6?s, if y, is a solution of (L:—u) y=0 
such that 4; (&))=0, and if yp is a solution in L®(&, 0), 
(2) Ce eee ee 

21 y Vo (Fo. H) V4 () Vo (E), Be 


The poles of G are thus the zeros, in the y-plane, of v3(&q, 11). 


Now y, is an L®)-solution of 


d?y 
a & 


(3) +(y?sinh?Eé+mu)y=O0 (6) S&< oo). 


The conditions that / be small and e be positive mean that equation (3) is to 
be considered for large |y|. Note that as e—1, &—0. If & is bounded from 
zero, then the differential equation (3) has a simple turning point at € when 


= —y?sinh?é,. One can show that for much smaller values of |u|, v2 is not 
oscillatory. Therefore to find the zeros y,, of y3(& , 2) one makes the substitution 
w=—y?sinh?é, and determines & so that yg vanishes. This is done with the 


aid of LANGER’s theory [8] for differential equations with simple turning points. 
The equation which determines yu, is found to be [4] 


(4) +s 4% H® (C) Jens, 5S aise oh 


§ 
Here C=y f (sinh?¢—sinh?,)'dt and H is a Hankel function. It is possible 
b1 
to solve asymptotically the transcendental equation (4) for €, and thereby to 


obtain the functional dependence of ,, on e and /, provided e is bounded away 
from 1. 

On the other hand, if e is so close to 1 that sinh?é)= 0 (y™), then the zeros 
of y3(&), 4) may be located by means of MCKELVEyY’s second order turning point 
theory [12]. In this instance, one lets ~=—yK and considers the equation (3) 
in the neighborhood of =0. The transcendental equation for yw, which corres- 
ponds to (4) is considerably more complicated *. 


The most important problem, however, is to derive the behavior of y,, as 
e->1. To do this one needs to know the behavior of y9(&, w) as a function of the 


* An account of this work will be given in a forthcoming University of Michigan 
Research Institute Report. 
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vanishing 2 between the two confluent turning points of the differential equation 
@Y 4 y2(sinh? € — sinh? A) y =0 
(5) aP + y? (sin — sin \v=0. 


This problem is presently being attacked via the theory in [6], which may be 
modified so as to apply to equation (5). Interest in the problem is pronounced 
because of the fact that as e—>1, A being fixed, slight changes in e produce rela- 
tively large changes in observed fields; that is to say, there is a resonance phe- 
nomenon. Moreover, none of the heuristic theories used by physicists and specialists 
in electromagnetic theory scattering problems are applicable in the resonance 
region. 

Of course, after one has found the yw, in these various situations, one must 
also derive the form of G (7, T, —M@,). This too requires turning point analyses 
of the sort described above. 


B) The prolate spheroid. This case is more complicated than that of the 
elliptic cylinder as it is essentially three-dimensional. The complication is re- 
flected in singularities in the operators L; and L,. Let B be a prolate spheroid 
with semi-major axis a, semi-minor axis 0, and eccentricity e=&". The operators 
L, and L, are then defined by the relations 


INV 


SL Va oP vee? te = te eee ea 
and 


Ly P= AP vel ee le) eed ee 


As before, y=ae(w—is) and |y| is large. If solutions y, and y, of (L:—y) y=0 
are determined as in the elliptic cylinder case, G(&, 5, w) is again defined by (2). 
Thus the problem reduces to one of locating the zeros wu, of y2(&, uw), Where yp 
is a solution of the equation 


(6) (= 4) vel ly (St) el 0 


and ¥, lies in L®(&), co). Equation (6) has both a regular singularity and simple 
turning points. It &) is bounded away from 14, and is bounded, that is, if e is 
bounded away from both zero and 1, then in determining the early w,, one may 
assume that w= 0 (y”) and use LANGER’s theory [8] for a simple turning point. 
The evaluation of G requires knowledge of solutions of (L,, —) y=0 near n= —1. 
Since this is a singular point of the differential equation, a different asymptotic 
scheme of LANGER’Ss [9] is used to evaluate G. This analysis was carried out 
in [4]. If (€—1)=O(y>), then the above mentioned study of (C= n)y=0 
no longer applies. The equation (L; —j) y=0 must be considered in the neigh- 
borhood of its singularity at §=1, and the turning point no longer plays a réle. 


To obtain the asymptotic behavior of solutions of (L:—p)yv=0 near €==4 
one uses MCKELVEy’s theory [13]. This analysis is not nearly as complicated, 
which is natural since it corresponds to the case of B being a needle; and a 
needle disturbs the incident radiation very little. 
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In the region between (&—1)=@(y) and (&—1)=@(1), the resonance 
region, the effect upon y9(&,) of both the turning point and singularity of 
(L:—j)y=0 must be considered. In particular, the problem is to determine 
the functional behavior of y, as these points coalesce. Although the situation 
in the limits is simple, the simple pole and zero cancel so that there is neither 
a turning point nor a singularity in the coefficient of y? to cause difficulty, no 
existing asymptotic theory appears to be adaptable to the solution of this problem. 


The above examples show that the field of diffraction theory is as rich a 
one for the application of the theory of asymptotic solutions of ordinary differ- 
ential equations with turning points as has yet been found. It demands all 
aspects of the theory and points out the need for new extensions. In case B 
is not a level surface of a coordinate system in which the operator V?+ (w — 7s)? 
is separable, neither the generalized Sturm-Liouville theory nor the turning-point 
theory we have discussed here applies. The development of satisfactory theories 
to replace them is perhaps the most challenging and difficult problem in this field. 


Part of this work was done under Contract AF-19(604)-4993, Air Force Cambridge 
Research Center, ARDC. 
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